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FINITE SIZE SCALING FOR THE CORE OF LARGE 
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The (two) core of a hypergraph is the maximal collection of hyper- 
edges within which no vertex appears only once. It is of importance 
in tasks such as efficiently solving a large linear system over GF[2], 
or iterative decoding of low-density parity-check codes used over the 
binary erasure channel. Similar structures emerge in a variety of NP- 
hard combinatorial optimization and decision problems, from vertex 
cover to satisfiability. 

For a uniformly chosen random hypergraph of m = np vertices 
and n hyperedges, each consisting of the same fixed number Z > 3 of 
vertices, the size of the core exhibits for large n a first-order phase 
transition, changing from o(n) for p > pc to a positive fraction of n 
for p < pc with a transition window size 0{n~^^^) around pc > 0. 
Analyzing the corresponding "leaf removal" algorithm, we determine 
the associated finite-size scaling behavior. In particular, if p is inside 
the scaling window (more precisely, p = pc + rn~^^'^), the probability 
of having a core of size 0(n) has a limit strictly between and 1, 
and a leading correction of order 0(n^^''®). The correction admits 
a sharp characterization in terms of the distribution of a Brownian 
motion with quadratic shift, from which it inherits the scaling with 
n. This behavior is expected to be universal for a wide collection of 
combinatorial problems. 

1. Introduction. The /c-core of a nondirected graph G is the unique sub- 
graph obtained by recursively removing all vertices of degree less than k. In 
particular, the 2-core, hereafter called the core of G, is the maximal collec- 
tion of edges having no vertex appearing in only one of them. With an abuse 
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of language we shall use the same term for the induced subgraph. The core 
of a hypergraph is analogously defined and plays an important role in the 
analysis of many combinatorial problems. 

In the first of such applications, Karp and Sipser [24] (hereafter KS) con- 
sidered the problem of finding the largest possible matching (i.e., vertex 
disjoint set of edges) in a graph G. They proposed an algorithm that recur- 
sively selects an edge («,j) G G for which the vertex i has degree 1. If no 
such edge exists, the algorithm declares a failure. Otherwise it includes it 
in the matching and removes it from the graph together with all the edges 
incident on j (that cannot belong to the matching). Whenever the algorithm 
successfully matches all vertices, the resulting matching can be proved to 
have maximal size. KS analyze the performance of such an algorithm on uni- 
formly random graphs with N vertices and M = \_Nc/2\ edges as — > oo, 
using the ODE asymptotic approximation for random processes, based on 
[25] (cf. [2, 16] for recent contributions). 

It is easy to realize that the algorithm is successful if and only if a prop- 
erly constructed hypergraph G does not contain a core. The hypergraph G 
includes a node e for each edge e in G, and a hyperedge i for each vertex i 
of degree 2 or more in G. The hyperedge i is incident on e in G if and only 
if e is incident on i in G. 

A more recent application is related to the XOR-SAT problem, a simpli- 
fied version of satisfiability introduced in [11]. One is given a linear system 
over m binary variables, composed of n equations modulo 2, each involving 
exactly I > 3 variables. The authors of [12, 29] propose a simple "leaf re- 
moval" algorithm to solve such a linear system. The algorithm recursively 
selects a variable that appears in a single equation, and eliminates the cor- 
responding equation from the system. In fact, such an equation can be even- 
tually satisfied by properly setting the selected variable. If all the equations 
are removed by this procedure, a solution can be constructed by running 
the process backward and fixing along the way the selected variables. 

A hypergraph G is associated to the linear system by including a vertex 
for each variable, and a hyperedge for each equation. Hyperedge e is incident 
on vertex i if and only if the corresponding equation involves the ith variable 
with nonzero coefficient. It is easy to realize that the leaf removal algorithm 
is successful if and only if the corresponding hypergraph G does not contain 
a core. 

Uniformly random linear systems with n equations and m = pn variables 
are considered in [12, 29]. It is proved there that the algorithm is successful 
with high probability if p is larger than a critical value pc, and fails with high 
probability if p < pc- See Figure 1 for an illustration of this phenomenon. 
Further, it is shown there that the structure of the set of solutions of the 
linear system changes dramatically at pc, exhibiting a "clustering effect" 
when p < pc- 
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Fig. 1. Probability that a random I — 3-hypergraph with m vertices and n = m/p hyper- 
edges has a nonempty 2-core estimated numerically for m = 100, . . . , 600. The vertical line 
corresponds to the asymptotic threshold pc ~ 1.2218. 



The same "solution clustering" phenomenon has been conjectured for a 
variety of random combinatorial decision problems, on the basis of nonrigor- 
ous statistical mechanics calculations. The most studied among these prob- 
lems is the random /C-satisfiability, for which some indication of clustering 
is rigorously proved in [5, 15]. Several authors suggest that the solution clus- 
tering phenomenon is related to the poor performance of search algorithms 
on properly chosen ensembles of random instances. Still within random K- 
satisfiability, the performance of certain standard solution heuristics (such as 
the "pure-literal" rule) is also related to the appearance of properly defined 
cores (see [28]). 

We conclude with the application to the analysis of low-density parity- 
check code ensembles, used for communication over the binary erasure chan- 
nel, which is the most relevant motivation for our work. The decoding of a 
noisy message amounts in this case to finding the unique solution of a linear 
system over GF [2] (the solution exists by construction, but is not neces- 
sarily unique, in which case decoding fails). If the linear system includes an 
equation with only one variable, we thus determine the value of this variable, 
and substitute it throughout the system. Repeated recursively, this proce- 
dure either determines all the variables, thus yielding the unique solution of 
the system, or halts on a linear subsystem each of whose equations involves 
at least two variables. While such an algorithm is not optimal (when it halts, 
the resulting linear subsystem might still have a unique solution), it is the 
simplest instance of the widely used belief propagation decoding strategy. 
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that has proved extremely successful. For example, on properly optimized 
code ensembles, this algorithm has been shown to achieve the theoretical 
limits for reliable communication, that is. Shannon's channel capacity (see 
[26]). 

Once again, one can construct a hypergraph G by associating a hyperedge 
to each variable, and a vertex to each equation (notice that this represen- 
tation is "dual" with respect to the one used for XOR-SAT). Decoding is 
successful (it finds the unique solution) if and only if this hypergraph does 
not contain a core. For a "reasonable" code ensemble the probability of this 
event approaches 1 (resp. 0) when the noise level is smaller (larger) than a 
certain critical value. See [26] for an explicit characterization of the critical 
noise value via an application of the ODE method (based again on [25, 31]). 
Though this result has been successfully used for code design, it is often 
a poor approximation for the moderate code block-length (say, n = 10^ to 
10^) that is relevant in practice. 

To overcome this problem, a finite-size scaling law is derived in [3], pro- 
viding the probability of successfully decoding in the double limit of large 
size n, and noise level approaching the critical value. In [3] the authors also 
conjecture a "refined" law that describes how the finite-size scaling limit is 
approached, and demonstrate empirically that this refined scaling formula 
is very accurate already for short message lengths n ~ 100, opening the way 
to an efficient code design procedure (cf. [4]). 

In this paper we resolve the conjecture by rigorously proving the refined 
scaling law. To simplify the exposition we focus on a specific choice of the 
random ensemble of equations, but our proof generalizes without much diffi- 
culty to a large variety of other cases, and in particular to all those mentioned 
in the conjecture of [3]. (In the coding language, the example we consider 
corresponds to LDPC ensembles with regular left and Poisson right degree; 
it also coincides with the random XOR-SAT ensemble introduced in [11] and 
treated in [12, 29].) In graph-theoretical terms, we determine the probability 
that a uniformly random'^ /-hypergraph (i.e., a hypergraph with hyperedges 
of size /) with n hyperedges and m = np vertices has a nonempty core as n 
grows and p = p{n) approaches pc- In the process of establishing the refined 
scaling law we gain much insight about the core of such random hyper- 
graphs. For example, we determine the fluctuations in the size of the core 
at criticality (see Remark 2.6), and show that if the hypergraph is built one 
hyperedge at a time, then its core size jumps from zero to a positive fraction 
of m at a random time tIc, the distribution of which we explicitly determine 
(cf. Remark 2.5). 



■^Indeed, we work with a properly defined "configurational" model (somewhat similar 
to the one introduced in [8]) to be defined in the next section. 
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Our proof strategy should apply without conceptual changes to other 
phase transitions within the same class, such as fc-core percolation on ran- 
dom graphs (with /c > 3), or the pure-literal rule threshold in random /c-SAT 
(with k>3; cf. [17]). Even beyond this family of closely related phenomena, 
the form of the refined scaling law (in particular, the scaling with n of the 
scaling window and of the first correction) are likely to be quite universal. 
For instance, in [3] it has been empirically found to hold for iterative decod- 
ing of LDPC codes over general channels. Within statistical physics, several 
core phase transitions have been studied as special examples of "mean field 
dynamical glass transition" [33]. It is possible that the refined finite-size 
scaling law generalizes to this (quite large) class as well. 

Finite-size scaling has been the object of several investigations in statis- 
tical physics and in combinatorics. Most of these studies estimate the size 
of the corresponding scaling window. That is, fixing a small value of e > 0, 
they find the amount of change in some control parameter which moves the 
probability of a relevant event from e to 1 — e. A remarkably general result in 
this direction is the rigorous formulation of a "Harris criterion" in [10, 35]. 
Under mild assumptions, this implies that the scaling window has to be 
at least il(n~^/^) for a properly defined control parameter (e.g., the ratio 
p of the number of nodes to hyper-edges in our problem). A more precise 
result has recently been obtained for the satisfiable-unsatisfiable phase tran- 
sition for the random 2-SAT problem, yielding a window of size Q{n~^^^) 
[9]. Note, however, that statistical physics arguments suggest that the phase 
transition we consider here is not from the same universality class as the 
satisfiable-unsatisfiable transition for the random 2-SAT problem. 

In contrast with the preceding, we provide a much sharper characteriza- 
tion, yielding beyond the scaling window and the limiting scaling function, 
also the asymptotic form of corrections to this limit. In this respect, our 
work is closer in its level of precision to that for the scaling behavior in the 
emergence of the giant component in Erdos-Renyi random graphs (for more 
on the latter, see [22] and the references therein). 

At the level of degenerate (or zero-one) fluid-limits, the asymptotic size 
of k-core of random graphs is determined by [31] via the ODE method. See 
also [28] for a general approach for deriving such results without recourse 
to ODE approximations (using instead a method analogous to the "density 
evolution" technique from coding theory). 

Darling and Norris determine in [14] the asymptotic size of the 2-core of 
a random hypergraph which is the "dual" of the model we consider here. 
Indeed, the hyperedges in their model are of random, Poisson distributed, 
sizes, which allows for a particularly simple Markovian description of the 
recursive algorithm that constructs the core. Dealing as we do with random 
hypergraphs at the critical point, where the asymptotic core size exhibits a 
discontinuity, they describe the fluctuations around the deterministic limit 
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via a certain linear SDE. In doing so, they heavily rely on the powerful 
theory of weak convergence, in particular in the context of convergence of 
Markov processes. For further results that are derived along the same line 
of reasoning; see [13, 18, 19]. 

In contrast, as we outline in the next section, the focus of this paper is 
on correction terms and rates of convergence. These are beyond the scope 
of weak convergence theory. In the context of our main result. Theorem 2.3, 
these only provide the limit, as n ^ oo and pn near its critical value, of the 
probability that a uniformly chosen random hypergraph with n hyperedges 
over npn vertices has a nonempty 2-core. 

The need to estimate correction terms is why many steps in our proof 
involve delicate coupling arguments, expanding and keeping track of the 
rate of decay of approximation errors (in terms of n). Our technique can 
be extended to provide rates of convergence (in the sup-norm) as n grows, 
for distributions of inhomogeneous Markov chains on M'^ whose transition 
kernels Wt^n{xt+i — xt = y\xt = x) are approximately (in n) linear in x, and 
"strongly elliptic" of uniformly bounded support with respect to y. 

2. Main result and outline of proof. We consider hypergraphs with n 
hyperedges over m = [np\ vertices, p > 0. Each hyperedge is an ordered list 
of / > 3, not necessarily distinct vertices chosen independently and uniformly 
at random with replacement. We are interested in the probability Pi{n,p) 
that a random hypergraph from this ensemble has a nonempty 2-core (i.e., 
the existence of a nonempty list of hyperedges such that, if a vertex appears 
in this list, then it does so at least twice). 

In the next section we construct an inhomogeneous Markov chain {z{t) = 
{zi{t), Z2{T)),n > r > 0}, where zi{t) and 2:2 (t) keep track, respectively, of 
the number of vertices of degree 1 and of degree at least 2 after r steps of 
the decimation algorithm. As we show in Section 5, in the large n limit, this 
chain is well approximated by a simpler chain with transition probabilities. 



(2.1) 



^n,p{z{T + l) = z+ {qi - qo, -qi)\z{T) = z] 




For x = z/n, 6 = r/n. 




Pi 



X2A 



i{i-e){e^-i-\y 



i{i-e) 



where for X2 > 0, we set A as the unique positive solution of 
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enforcing po + Pi + p2 = 1? while for X2 = 0, we instead set by continuity 
pi = and p2 = 1 - po- 

Further, we show in Lemma 4.4 that n~^z{0) converges to the nonrandom 
vector 

(2.4) y(0) = {le-^/P, p{l - e"'/") - /e"'/^). 

Since the chain (2.1) has bounded increments, and the corresponding proba- 
bihties depend on the state only through the macroscopic variables x and 9, 
it is not hard to verify that the scaled process n^^z{8n) concentrates around 
the solution of the ODE 

(2.5) ^{9)=F{m,0), 

where F{x,6) = (-1 + (/ - l)(pi - po), -(/ - l)pi) is the mean of z{t + 1) - 
z{t) under the transitions of (2.1); see, for instance, [12, 26, 29]. The solution 
of this ODE will be denoted by y{0, p), often using the shorthand y{9) (where 
the fixed value of p is clear from the context). From the solution, one finds 
that yi{9) remains strictly positive for all 6 £ [0,1) if and only if p > Pc 
[see (4.3)]. As shown in [26] this indicates that, with high probability, the 
algorithm successfully decimates the whole hypergraph without ever running 
out of degree 1 vertices if p > pc- Vice versa, for p < pc, the solution y{9) 
crosses the yi = plane; this is shown to imply that the algorithm stops 
and returns a large core with high probability. In the critical case p = pc, 
the solution y{9) touches the yi = plane at the unique time 9 = 9c£ (0, 1) 
(see Proposition 4.2). The principal conclusion of Section 5 is that, near 
criticality, Pi{n, p) can be estimated by the probability that z{t) is small in 
a neighborhood of r = n9c- More precisely: 

Proposition 2.1. Let P e (S/A,!), Jn = [n9c-n^,n9c + nf^] and\p- 
Pc\ < ra^'^^ with /?' < 2/3 - 1. Then for e„ = ^logn, 

(2.6) FnJ inf zi{t)<- < Pi{n-, p) <Fn,p{ inf zi(r) < e„ ^ 

where 6n = Dn~^/'^{\ogn)'^ . 

At the critical point (i.e., for p = Pc and 9 = 9c) the solution of the ODE 
(2.5) is tangent to the yi = plane and fluctuations in the yi direction 
determine whether a large core exists or not. Further, in a neighborhood 
of 9c, we have yi{9) ~ ^F{9 — ^c)^, for some F > 0. In the same neighbor- 
hood, the contribution of fluctuations to the change of zi is approximately 

y^Gn(^^-^^, with G > 0. Comparing these two contributions we see that 

the relevant scaling is Xn{t) = n~^/^[zi{n9c + ri^^^t) — zi{n9c)], which for 
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large n converges, by strong approximation, to X{t) = ^Ft'^ + \/GW{t), for 
a standard two-sided Brownian motion W{t) (see Lemma 6.1 for a precise 
quantitative statement). Clearly, 

In the last expression we adopted the convention (to be followed hereafter) 
of omitting the arguments whenever they refer to the critical point 6 = 9c, 
y = y{Oc) and the trajectory considered is the critical one, that is, p = pc- 

Fluctuations of z{n6c) around ny{6c) are accumulated in n9c stochastic 
steps, and are therefore of order ^/n. As shown in Section 6, the rescaled 
variable {z{n6) — ny{0)) / ^/n converges to a Gaussian random variable. Its 
covariance matrix Q{6,p) = {Qab{G,p)', 1 < a, ^ < 2} is the symmetric posi- 
tive definite solution of the ODE: 



(2.8) = G{y{e),0) + A{m,0)Q{e) + Q{9)A{y{e),0)K 

where A(x, 6) = {Aabix, 0);l<a,b<2} for Aab{x, 9) = dx^Faix, 9), and G{x, 6 
is the covariance of z{t + 1) — z(t) under the transitions (2.1), that is, the 
nonnegative definite symmetric matrix with entries 

r Gu{x,9) = {l-l)[po+Pi-{Po-Pi)% 

(2.9) <^ Gi2(f,e) = -(/-l)[PoPi+Pi(l-pi)], 
lG'22(f,e) = (/-l)Pi(l-Pi). 

Here again we use the convention Q(^) = Q{9, p) when the value of p is clear 
from the context. The positive definite initial condition Q(0) for (2.8) is 
computed on the original graph ensemble, and given by 

Qii(0) = -^7e-2^(e^-l + 7-7'), 



(2.10) 



7 

Qi2(0) = --7e-2^(e^-l-7'), 
7 

Q22(0) = -e-^^[[e^ - 1) + 7(eT - 2) - 7^(1 + 7)], 
7 



where = 1/ p (see Section 4.2 for details). 

The parameter describing the fluctuations of zi{n9) — zi{n9c) for 9 near 
9c is simply G = Gii{y{9c),9c)- As we show in Section 6, this analysis allows 
us to approximate the probability that z{t) approaches the zi = plane, by 
replacing {z{t)} by an appropriately constructed Gaussian process. 

Proposition 2.2. Let X{t) = \Ft^ + \fGW{t) where W{t) is a doubly 
infinite standard Brownian motion conditioned to W{0) = 0. Further, let 
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i^(r) be a normal random variable of mean {^§^)t and variance Qu (both 
evaluated at = 9c and p = pc), which is independent of W{t). 

For some f5 E (3/4, 1), any rj < 5/26, all A> 0, r gR. and n large enough, 
if Pn = Pc + rn'^/"^ and En = A\ogn, then 



(2.111 



mf zi{t) < ±e„j - Pjni/^e + inf X(t) < o| 



< n 



We note in passing that within the scope of weak convergence, Aldous 
[1] pioneered the use of Brownian motion with quadratic drift [a la X[t) 
of Proposition 2.2], to examine the near-critical behavior of the giant com- 
ponent in Erdos-Renyi random graphs, and his method was extended by 
Goldschmidt [19] to the giant set of identifiable vertices in Poisson random 
hypergraph models. 

The key to the validity of Proposition 2.2 at the o{n' 

-V6) level of accuracy 

relevant here, is the fact that within the critical time window J„ the Markov 
chain of transition probabilities (2.1) is well approximated by the chain 

(2.12) ^{t + 1) = ^{t) + A,(n- V(t) - y(r/n)) + A, 

with At- = lr<Tn^{y{T /n, p),T /n) for r„ = \n9c — n^J , and independent ran- 
dom variables {A,-} of mean F{y{T/n),T/n) and covariance G(y(r/n), r/n) 
(cf. Proposition 5.5). In particular, taking 

(2.13) Ml=(l + -Ar] ■ (l + -A,_i) • • Yl + -A,) , 



n J \ n J \ n 



for integers < ex < r (while = I in case r < cr), we see that 
(2.14) z-'(r) =i5-'"(0) + Ei;;}(A, - A.y>/n)) 

(T=0 

is a sum of (bounded) independent random variables, hence of approximately 
normal distribution. Further, the mean and covariance of ^{t) are given by 
discretized versions of (2.5) and (2.8), hence are sufficiently close to the 
solutions y{0,p) and Q{9,p) of these ODEs (cf. Lemma 4.3). 

Combining Propositions 2.1 and 2.2, we are now able to estimate the 
desired probability Pi{n,p) in terms of the distribution of the global mini- 
mum of the process {X(t)} (i.e., a Brownian motion plus a quadratic shift). 
The latter has been determined already in [20], yielding the following con- 
clusion, which is our main result. Figure 2 illustrates the accuracy of the 
finite-size scaling expression proved below, by comparing it with numerical 
simulations. 



10 



A. DEMBO AND A. MONTANARI 














V 200 J< • 




\^ .100 • 








\ 600 « 

\ 







■3 -2 -10 12 S 

h 

Fig. 2. The numerical estimates for the core probabilities in Figure 1, plotted 
versus scaling variables f\, f^- On the left: fi — \/n{p — pc)/ai. On the right: 
f2 = \A^(p — Pc — 6in~^^^)/ai where Si = aif3iQ. According to Theorem 2.3, corrections 
to the asymptotic curve $(— f) (dashed) are Q{n~^^^) on the left, and 0{n~^^^^'^^) on the 
right. 

Theorem 2.3. Let I > 3, and define ai = VQII{^)~^ , A = -7=&^^ x 



F-'/', Pr. 

(2.15) 



dp 

= Pc + rn~^^'^. Then, for any < 5/26 

Pl{n,pn) = ^{-r/ai)+pin^'{-r/ai)7i-^/^+0{n-'^) 



where ^{x) denotes the distribution function for a standard normal random 
variable, the finite constant il. is given by the integral 

(2.16) 

where 

(2.17) )C{z)^^l -y^^>-^- - ' -'l^^-'-'^' ^ ' dy, 



1 



0= / [l-lC{zY]dz, 
Jo 

Ai{iy)Bi{2^/^z + iy) - Ai(2V3^ + iy)Bi{iy) 



2 J-oc Ai(iy) 
and Ai(-), Bi(-) are the Airy functions (as defined in [6], page 446)- 



Remark 2.4. The simulations in Figure 2 suggest that the approxima- 
tion of Pi{n,pn) we provide in (2.15) is more accurate than the 0{n~^^'^^~^^) 
correction term suggests. Our proof shows that one cannot hope for a bet- 
ter, error estimate than 0(n~^/^) as we neglect the second-order term in 
expanding $(— r/a; + Cn~^^^); see (2.18). We believe this is indeed the or- 
der of the next term in the expansion (2.15). Determining its form is an 
open problem. 

Remark 2.5. It is of interest to consider the (time) evolution of the 
core for the hypergraph process in which one hyperedge is added uniformly 
at random at each time step. In other words, n increases with time, while 
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the number of vertices m is kept fixed. Let S{n) be the corresponding (ran- 
dom) number of hyperedges in the core of the hypergraph at time n and 
Uc = min{n : S{n) > 1} the onset of a nonempty core. Prom Lemma 4.7 we 
have that for any p > there exist At > and C < oo such that {S{n) : < 
n < m/p} intersects [l,mK] with probabihty at most Cm}^^/"^ . Further, fix- 
ing p < pc the probabihty of an empty core, that is, S{m/p) = 0, decays 
(exponentially) in m. We thus deduce that for large m most of the trajec- 
tories {S{n)} jump from having no core to a linear (at least m/t) core size 
at the well-defined (random) critical edge number ric- By the monotonicity 
of S{n) we also know that ¥m{nc < m/p} = Pi{p,m/ p). Therefore, Theo- 
rem 2.3 allows us to determine the asymptotic distribution of Uc- Indeed, 
expressing n in terms of m in (2.15) we get that for each fixed x G M, 

P{nc < mp"^ + m^/^p-^/'^aix} = $(x) + AJ7/3y^$'(x)m^^/^ + ©(m"''), 

whence we read off that ric = {ric — rri/ pc) / {y/mpc'^^'^ ai) + Pi^pl^^m~^^^ 
converge in distribution to the standard normal law [and the corresponding 
distribution functions converge pointwise to $(x) at rate which is faster than 
m~'^ for any t] < 5/26]. 

Remark 2.6. Our techniques are applicable to many other proper- 
ties of the core in the "scaling regime" Pn = Pc ~\~ ttT^I'^ . For example, 
the distribution of the number of hyperedges S in the core can be de- 
rived from the approximation of the trajectory of the decimation algo- 
rithm. Namely, as shown in Section 6, for such near the critical time 
zx{t) ~ ^/^i{r) + X„(t) for i{r) and Xn{t) = n^/^X{n-'^/^{t - nO^)) as in 

Proposition 2.2. With EX„(t) = ^(t — nOcf, upon noting that n — S = 
m.m{t : zi{t) = 0}, we obtain that, conditional to the existence of a nonempty 
core, {S — n{l — 9c))/'nP^^ converges in distribution to {4Qii/F'^y^^Z with 
Z a nondegenerate random variable. Indeed, at the relevant time window 
n9c ± 0(n^/^) the contribution of Xn{-) — EX„(-) to the fluctuations of S is 
negligible in comparison with that of ^/n£,{r). So, more precisely, based on 

the explicit distribution of ^(r) we have that Z = \/U — rb for b = Qn^'^^^ 
and U a standard normal random variable conditioned to U > rb. In formu- 
las, Z is supported on and admits there the probability density 

22e-{l/2)(r6+z2)2 

Pz{z) -- 



/27r[l - $(r6)] 

Naively one expects the core size to have G(n^/^) fluctuations. This is indeed 
the asymptotic behavior for a fixed p < pc, but as usual in phase transitions, 
fluctuations are enhanced near the critical point. 

The distribution of the fractions of vertices with a given degree within 
the core can be computed along the same lines. 
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Remark 2.7. As already pointed out, our proof concerns a properly de- 
fined configuration model whereby each edge might include the same vertex 
more than once ("self-loop"), and two hyperedges might include the same 
vertices ("double edge"). We expect a result similar to Theorem 2.3 to hold 
for a uniformly random hypergraph, with forbidden self-loops and double 
edges. 

The main difficulty in proving such a generalization would be the absence 
of an explicit representation for the kernel of the leaf removal process. In 
the present case, such an expression is known and provided by Lemma 3.1. 
Within the uniform model, one should resort to graph enumeration formulas 
as the ones in [27]. This would give rise to a new Markov chain that can 
nevertheless be coupled to the one defined in (2.1). The thesis would follow 
by bounding the expected maximum distance between the trajectories of 
the two chains. 

Proof of Theorem 2.3. Putting together Propositions 2.1 and 2.2, 
we get that 



P;(n, p„) = p|ni/6^ + miX{t) < o| + 0{n-''). 
By Brownian scaling, X{t) = F-^/^&/^X{F'^/^G-^/H), where X{t) = + 



W{t) and W{t) is also a two-sided standard Brownian motion. With Z = 
inftX(t), and Y a standard normal random variable which is independent 
of X{t), we clearly have that 



The proof of the theorem is thus completed by a first-order Taylor expansion 
of <!>(•) around —r/ai, as soon as we show that KZ = —Q, and E|Zp is 
finite. To this end, from [20], Theorem 3.1, we easily deduce that Z has 
the continuous distribution function Fz{z) = 1 — /C(— z)^ for z < 0, while 
Fz{z) = 1 for 2 > 0, resulting after integration by parts with the explicit 
formula (2.16) for 17. We note in passing that taking c = 1/2 and s = in [20], 
(5.2), provides the explicit formula (2.17) for /C(x), en-route to which [20] 
also proves the finiteness of the relevant integral. Further, [20], Corollary 3.4, 
shows that the probability that the minimum of X{t) is achieved as some 
t ^ [-r,T] is at most ^e^^"^'' for a positive constant Aq. With X{t) > 




(2.18) 



+ 0{n~'i) 
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W{t) we therefore have that 

Fz{z) = F{Z <z}< p|^^ in|^^X(t) < z| 'e"^"^' < g-^'/^^ + ^o 'e"^"^'. 

Taking T = ^/z we deduce that if z < 0, then Fz[z) < C'^ exp(-C|2p/2) foj. 
some C > 0, which yields the stated finiteness of each moment of Z (and in 
particular, of E|Zp and Q). □ 

3. Ensembles and transition probabilities: exact expressions. 

3.1. Model for the (initial) graph. Throughout the paper we follow the 
coding literature and identify the hypergraph with a bipartite graph with 
two types of nodes: v-nodes, corresponding to hyperedges, and c-nodes to 
vertices. A graph G in the ensemble Q = Qi{n, m) consists of a set of v-nodes 
y = [n] , a set of c-nodes C = [m] and an ordered list of edges, that is, couples 
(z, o) with i gV and a £ C 

E = [(l,ai), (1,02), • • • , (2,ai+i), . . . ; {n, 0(^^-1)1+1), (re, 

where a couple {i,a) appears before {j,b) whenever i<j and each v-node i 
appears exactly I times in the list, with / > 3 a fixed integer parameter. The 
total number of graphs in this ensemble is thus 

(3.1) |gz(n,m)| =m"' = coeff[(e^r,x"'](nO!. 

The ensemble of graphs Q is endowed with the uniform distribution. One 
way to sample from this distribution is by considering the v-nodes in order, 
i = 1, . . . , n, where for each v-node and for j = 1, . . . ,1, we choose indepen- 
dently and uniformly at random a c-node a = a(j_i);+j G C and add the 
couple {i,a) to the list E. An alternative way to sample from the same 
distribution is by first attributing I sockets to each v-node, with sockets 
{i — 1)1 + 1, ... ,il attributed to the ith. v-node. Then, we attribute ka sockets 
to each c-node a, where kaS are mutually independent Poisson(C) random 
variables, conditioned upon their sum being nl (these sockets are ordered 
using any pre-established convention). Finally, we connect the v-node sock- 
ets to the c-node sockets according to a permutation a of {1, . . . ,nl} that is 
chosen uniformly at random and independently of the choice of k^s. 

Throughout the degree of a v-node i (or c-node a) will refer to the number 
of edges {i,b) [resp. (j,a)] it belongs to. In the hypergraph description, this 
corresponds to counting hyperedges, and vertices with their multiplicity. 
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3.2. Model for the graph produced by the algorithm. The ensemble is 
characterized by the nonnegative integers (^1,^2) = z, t and I > 3, n,m 
and denoted ^ as Q{z,t). In order for Q{z,t) to be nonempty, we require 
either ^2 > 1 and zi + 2z2 < {n — t)1 or ^2 = and zi = {n — t)1. An ele- 
ment in the ensemble is a graph G= {U,V; R, S,T; E) where U,V are dis- 
joint subsets of [n] with U U V = [n] and R,S,T are disjoint subsets of 
[m] with RU S U T = [m], having the cardinalities |C/|=r, \V\ = n — t, 
\R\ = m — zi — Z2, \S\ = zi, \T\ = Z2- Finally, E is an ordered list of (n — t)1 
edges 



such that a couple {i, a) appears before (j, b) whenever i < j. Moreover, each 
i GV appears as the first coordinate of exactly / edges in E, while each j €U 
does not appear in any of the couples in E. Similarly, each a £ R does not 
appear in E, each b £ S appears as the second coordinate of exactly one edge 
in E, and each c£T appears in at least two such edges. The total number 
of elements in Q{z,t) is thus 



The ensemble Q(z,t) is endowed with the uniform distribution. In order 
to sample from it, first partition [n] into U and V uniformly at random 
under the constraints |C/| = r and \ V\ = {n — t) [there are (") ways of doing 
this], and independently partition [m] to R L) S U T uniformly at random 
under the constraints =m — zi — Z2, \S\ = z\ and |T| = 2:2 [of which there 
are (^^^j .) possibilities]. Then, attribute I v-sockets to each i£V and num- 
ber them from 1 to (n — r)/ according to some pre-established convention. 
Attribute one c-socket to each a G S and ka c-sockets to each a €T, where 
ka are mutually independent Poisson((^) random variables conditioned upon 
ka > 2, and further conditioned upon J2aeT^a being (n — t)1 — z\. Finally, 
connect the v-sockets and c-sockets according to a uniformly random per- 
mutation on (n — t)1 objects, chosen independently of the ka's,. 

3.3. Transition probabilities. We consider the graph process {G(r),r > 
0}, defined as follows. The initial graph G(0) is a uniformly random element 
of Qi{n,m). At each time r = 0, 1, . . . , if there is a nonempty set of c-nodes 
of degree 1, one of them (let us say a) is chosen uniformly at random. The 
corresponding edge {i, a) is deleted, together with all the edges incident to 



^ Since n, m and I do not vary during the execution of the algorithm, we leave them 
implicit in the ensemble notation. 



E = [(ii,ai), . . . , (ii,a;); (z2, a/+i), . . . ; 

{in~Ti (^(n— T~ 1)1+1) 1 ■ • ■ ) i}n—Ti (^{n 
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the v-node i. The graph thus obtained is G{t + 1). In the opposite case, 
where there are no c- nodes of degree 1 in G(r), we set G{t + 1) = G(r). 

We define furthermore the process {z{t) = {zi{t), Z2{t)),t > 0} on Z^. 
Here zi{t) and Z2{t) are, respectively, the number of c-nodes in G(t), having 
degree 1 or larger than 1, which necessarily satisfy that [n — t)1 > zi{t) + 
2z2{t) for 9 = min(r, infjr' > : zi(r') = 0}). 

Lemma 3.1. The process {z{t) r > 0} is an inhomogeneous Markov pro- 
cess, whose transition probabilities, denoted by 

W+{Az\z) = ¥{z{t + 1) = z + Az\z{t) = z] 

[here Az = {Azi, AZ2)], are such thatW^{Az\z)=I{Az = 0) in case zi = 0, 
whereas for z\ > 0, 

W^^iAz\z) = %^{r + 1)/! y2(m-z[-z'A/^'i)(4)S0 

(3.2) 

X coeff[(e'' - 1 -x)Po(e'' - l)9i+«2^x'-''o]. 

Here z[= zi+ Azi, Z2= Z2 + Az2- Also, using the notation ZQ = m — zi — Z2 
and z'q = m — z[ — Z2, the collection T> consists of all integers po, qo,qi,q2 ^ 0, 
satisfying the equalities 

{zo = z'o - qo - po, 
zi=z[ + qo-qi, 
Z2 = Z2+P0 + qi, 

and the inequalities {n — T)l— (zi +2^2) > I — (2po + '?o + 9i) ^ 92, 9o+Po ^ z'q, 
Qi < z[ (equivalently, qo < zi), q2 < Z2 (equivalently, Po + qi + q2 1^ Z2)- 

Moreover, conditional on {z(r'),0 < r' < r}, the graph G{t) is uniformly 
distributed over Q{z,t), that is, 

(3.4) P{G(r) = G|{z(r'), < r' < r}} = jjl^KG G G{z, r)). 

Proof. Fixing r, z = z(t) such that zi>0, z' = ^(r + 1) and G' G Q{z', 
T + 1), let N{G'\z,t) count the pairs of graphs G G G{z,t) and choices of 
the deleted c-node from S that result with G' upon applying a single step of 
our algorithm. Obviously, G and G' must be such that Rc R' , S R' D S' 
and T' C T. So, let qo>0 denote the size of R' CiS, po>0 the size of R' H T, 
and qi>0 the size of S' n T. We have qo + po < m — z'^ — z'2, qi < z'l, and 
the equalities of (3.3) follow as well. Let T* denote the set of c-nodes a&T' 
for which ka> k'^, and denote the size of T* by (72 < -^2 ■ Observe that of the 
/ edges of the v-node i deleted by the algorithm in the move from G to G', 
exactly one edge hits each of the nodes in R' f] S, at least one edge hits each 
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of the nodes in S' H T, and each of the nodes in T* , while at least two edges 
hit each of the nodes in R' n T. Consequently, 2po + 9o + ^Zi + 92 < ^- Since 
zi > 0, we know that t = t and further, [n — t — 1)1 > z'l + 222, which in 
view of (3.3) is equivalent to (n — r)Z — {zi + 2z2) > I — (2po + Qo + Qi) > Q2 
as claimed. 

To count N{G'\z,t) we first select the v-node i to add to G' from among 
the T + 1 elements of U' , and the order (permutation) of the / sockets of 
i that we use when connecting it to the c-nodes for creating G £ Q{z,t). 
Summing over the set V of allowed values oipo, qo,qi,q2, for each such value 
we have ("^~^i~^2\ ways to select the nodes of R' that are assigned to S, 

\ qo,PO,- , to ) 

T and R, then (^i) ways to select those of S' that are assigned to T and 
(^2) ways to select those of T' that are assigned to T* . We further have 
coeff[(e'' — 1 — :x.)P°{e^ — 1)91+92^ x'~''o] ways to select the precise number of 
edges (> 2) from i that we are to connect to each of the po nodes in R' n T, 
and the precise number of edges (> 1) from i that we are to connect to each 
of the qi nodes in S' OT and to each of the q2 nodes in T* , while allocating 
in this manner exactly / — qo edges out of i (the remaining qo then used to 
connect to nodes in R' D S). Finally, noting that for each of the graphs G 
thus created we have exactly qo ways to choose the deleted node from S 
while still resulting with the graph G' , we conclude that 

NiG'\z,T) = {T+i)uy (^'A (^'^ 

X gocoeff[(e^ - 1 - x)Po(e^ - l)«i+'?2,x'-'?o]. 

We start at r = with a uniform distribution of G(0) within each possible 
ensemble G{z{0),0). Since N{G'\uj,T) depends on G' only via uj' , it follows 
by induction on r = 1, 2, . . . that this property, namely (3.4), is preserved as 
long as f = T, since if zi{t) > 0, then 

P{G(r + 1) = G'\{z{r'),0 <r'< r}} = 

Zl h{z(T),T) 

is the same for all G' G G{z + Az, r + 1) . Since there are exactly h{z + Az, r + 
1) graphs in this ensemble, we thus recover also (3.2). Finally, noting that 
G{t) = G{t) and z{t) = z{t) we deduce that (3.4) holds also when f < r. 
□ 



4. Asymptotic expressions. 



4.1. Properties of the ordinary differential equations. We derive here the 
properties of solutions of the ODEs (2.5) and (2.8) that are needed for our 
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analysis. This is based on the continuity of {x,9) pa{x,0), o = 0,1,2 on 
the following compact subsets of x : 

q{e) = {{x, d) : -I < xi;0 < X2;9 e [0,1 - £];0 < {1 - 0)1 - max(xi, 0) - 2x2}, 
and q+{e) = q{e) n {xi > 0}, as stated in 

Lemma 4.1. For any e > 0, the functions (x, 9) pa{x,9) , a = 0, 1, 2 are 
[0,l]-valued, Lipschitz continuous on q{e). Further, on q+{e) the functions 
{x,9) pa{x,9) have Lipschitz continuous partial derivatives. 

Proof. Fixing e > 0, the stated Lipschitz continuity holds for Pq{x,9) 
since both max(xi,0) and 1/(1 — 9) are Lipschitz continuous and bounded 
on q{e). Further, po{x,9) € [0,1] throughout q{e). Setting /i(0) = 2, note 
that — > [2, 00) of (2.3) is a monotone increasing, twice continuously 
differentiable function, with /{(A) = [(e^ - 1)^ - X'^e^]/{e^ - 1 - A)^ strictly 
positive and bounded away from zero throughout M+. Consequently, the 
inverse mapping f^^ is well defined and twice continuously differentiable 
on [2, 00), from which we deduce that for each 5 > the nonnegative func- 
tion X{x,9) is well defined, bounded and continuously differentiable on the 
compact set q{e) n {{x,9) ■.X2 > 5}. Though \{x,9) t 00 as X2 i 0, note that 
p2 = (1 — Po)(l — dW) foi' = ^/{^^ ~ !)• Ill particular, since p2 = 1 — Po 
in case x^ = 0, it follows that p2{x,9) is continuous throughout q{e). Since 
Pq{x,9) is Lipschitz continuous on g(e), the Lipschitz continuity of p2 follows 
by showing that, for xi 7^ 0, g{X{x, 9)) has bounded derivatives as X2 [ 0. By 
letting ^ = {x,9) € ^(e), we have d^.g{X) = g' {X)d^^X. Using the definition 
(2.3), and recalling that /((A) is bounded away from zero, it follows that 
|%A| < Cx2^ as X2 i 0. On the other hand, \g\X)\ < Ce~^ < Ce^^'/^2 
the same limit thus implying that d^.g{X) is bounded. 

Further, the identity (2.3) is equivalent to po + Pi +p2 = 1, which thus 
implies that pi is also Lipschitz continuous on q{e). Finally, since both X{x, 9) 
and X2 are nonnegative throughout ^(e), the same applies for pi and p2, and 
consequently, pa S [0, 1] for a = 0, 1, 2. 

Considering for the remainder of the proof ^ = {x,9) G §+(e), we replace 
max(2;i,0) by xi in the definition of (pOiPi5p2)- The stated regularity of po 
is then obvious and as before the regularity of pi = 1 — po — p2 follows from 
that of p2 = (1 — Po)(l — oW)- To this end, we see that it suffices to show 
that d^.g{X) = g' {X)d^^X, are Lipschitz continuous in ^ on the compact set 
q+{s)- As seen already A ^ g'iX) is bounded and Lipschitz continuous on M+, 
and (9^.A(^) is bounded and has bounded derivatives on q+{e) H 1x2 > 6}. 
The proof is completed by showing that d^j[g' {X)d^.X] = g" {X)d^^Xd^^X + 
g' {X)d^.d^.X converges to zero as X2 — * 0. This is proved similarly to what 
was already done for the first-order derivatives. Indeed, the first and second 
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derivatives of A i-^ fiW as well as i-^ d^. [x2/i(A)] and its partial derivatives 
are all bounded, hence |%A| < ^ and \d^^d^^X\ < Cx2^ as X2 0, which 
since A — > oo inversely proportional to X2 — > 0, is more than compensated by 
the exponential decay in A of g' and g" . □ 

Setting hp{u) = n — 1 + exp(— Zu'^^/p) and the finite and positive critical 
density 

Pc = inf{p > : hp{u) > Vu G (0, 1]}, 
we have the following properties of the ODEs. 

Proposition 4.2. For any p> 0, the ODE (2.5) admits a unique so- 
lution y subject to the initial conditions (2.4), and the ODE (2.8) admits a 
unique, positive definite, solution Q subject to the initial conditions (2.10), 
such that: 

(a) For any e > 0, 9 <\ — e, we have that {y{6,p),0) is in the interior 
of q{£), with both functions {0,p) i— > y and i— > Q Lipschitz continuous on 
{e,p)G[0,l-£)x[e,l/e]. 

(b) Let u{9) = (1 - ey/^ and O-ip) = mf{9 > : hp{u{9)) < 0} A 1. Then, 
for9e[0,9^{p)] 

(4.1) y,{e, p) = lu{e)'~\u{e) - l + e^^^W'"^], 

(4.2) y2{e,p) = -[1 - e-^"(^)"' - 7n(0)'-^e-^"(^)'"] 

7 

(where 7 = 1/ p). In particular, {6,p) ^ y is infinitely continuously differen- 
tiable and {9,p)h^Q is Lipschitz continuous on {(^, p) : < min(0„(p), 1 — 
£),e<p<l/£}. 

(c) Let 9^{p) = mi{9>0:hp{u{9)) <0}. Then, 9^{p) = sup{9 < l:yi{9' , 
p) > for all 9' G [0,6*)} and the critical density is such that 

(4.3) pc = inf{p > : 6'*(p) = 1} = inf{p > 0:yi{9, p) > ^9 £ [0, 1)}. 

(d) The critical time 9^ = 9^:{pc) is in (0,1), whereas 9-{pc) = 1. For 
p = Pc the infinitely continuously differentiable function ^ yi(^) is positive 
for 9^ 9c, 9^1, with yi{9c) = y'M = 0, and y'i{9c) > 0, while yi(l -5) = 
15 + o{5) for any 5 > 

Proof, (a) For any p > the initial condition y(0) of (2.4) is such that 
(y(0), 0) is in the interior of g(e). Further, fixing e > 0, by Lemma 4.1 we have 
that F{x,9) is bounded and Lipschitz continuous on q{£). Consequently, for 
9 G [O,0e] there exists a unique solution y{9) of the ODE (2.5) [i.e., ^ = 
F{y,9)], starting at this initial condition, where 0^ = mi{0 > 0: {y{9),9) ^ 
q{e)} is strictly positive, and {y{9ir),9s) is necessarily on the boundary of 
q^e). We proceed to verify that 0£ = 1 — e by showing that: 
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(i) yi{9e) > —I- Indeed, since yi(0) > and Fi{y{9),9) > —I, we have 

yiiOe) > -We > -I. 

(ii) 2/2 (^e) > 0. In fact X2 = implies pi(x, 0) = 0, and therefore F2{x, 6) = 
0. By the Lipschitz continuity of F2 on q{e) it follows that F2{x,6) > —Cx2 
for some finite C and all X2 in q{e). Therefore, y2{0e) > y2(0)e-^^= > 0. 

(iii) v{9e) > 0, where t;(6') = w{y{6),9) for w{x,9) = l{l-9)-max{xi,0)- 
2x2- Indeed, note that v{0) > and 



Further, recall that if w{x, 9) = 0, then p2(x, 9) = 0, and if in addition xi < 0, 
then also pi(x, 9) = 1. Hence, by the Lipschitz continuity of pi(-, •) and p2(-, •) 
on q{e) we have that p2(^, 9) < 2Cw{x,9) and pi(x, 9) > {1 — C'w{x, 9))I{xi < 
0) for some finite C > 0, throughout q{£). Since / > 2, it follows that ^ > 
-2{l - l)Cv{9) for all 9 £ [0,9^], resulting with v{9e) > 7;(0)e-2('-i)C^- > 0. 

Lemma 4.1 further implies that for any a,b £ {1,2} both Aab{x, 9) = dxi,Fa{x, 9) 
and Gab{x,9) are uniformly bounded over The linear ODE (2.8) has 
these functions as its coefficients, for x = y{9). We thus deduce that there 
exists a unique solution Q(0) of the initial value problem for this ODE at 
least for 9 E [0, 9^] . With 9s = l — £ and e > arbitrarily small we established 
the existence of a unique solution (y, Q) for 9 G [0, 1). 

It also follows from the above discussion that 9^ = 1 — £ and y{9, p) is 
Lipschitz continuous in 9 on [0, 1 — e] x [e, 1/e]. Further, applying Gronwall's 
lemma, the Lipschitz continuity of F{x, 9) implies that the solution y{9) 
of the ODE is then also Lipschitz continuous with respect to the initial 
condition y(0), with a uniform in 9 < 1 — £ bound on the corresponding 
Lipschitz norm. Clearly, y{0) of (2.4) is differentiable in p with a uniformly 
bounded derivative when p £ [e, 1/e]. Consequently, we arrive at the stated 
Lipschitz continuity of {9,p) ^ y{9,p). 

The same argument shows that the initial conditions (2.10) for the ODE 
(2.8) are bounded in p £ [e, 1/e]. Further, y{9,p) stays in g(e) and with the 
coefficients of the linear ODE (2.8) uniformly bounded on [0, 1 — e] x [e, 1/e], 
its solution Q is also Lipschitz continuous in 9. Suppressing the dependence 
of the various matrices on p, set = I and, for 9 > 



— = [-l + 2{l - l)pi(y,0)]I(yi(0) < 0) - (/ - l)p2(y,0)I(yi(^) > 0). 



(4.4) 




It is easy to check that the unique solution of (2.8) is given by 



(4.5) Q{9) = BgQ(0)(B^)t + ^ B^G(y(C), C)(Bc)^ ^C, 
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for the nonnegative definite matrix G{x,0) of (2.9). In particular, starting 
from the symmetric, positive definite Q(0) of (2.10), this implies that Q(^) 
is nonnegative definite. Further, since detBg = 1 and 



with the entries of A{x,9) uniformly bounded, it follows that detBg > 0, 
hence the solution Q{0) of (2.8) is positive definite. 

(b) Though this is a special case of a result of [26], we provide its short 
proof for the reader's convenience. We first check that y{9,p) of (4.1) and 
(4.2) is the unique solution of the ODE (2.5) for 9 G [0,e^{p)]. To this end, 
first note that for 9 = the functions y{9, p) of (4.1) and (4.2) satisfy the ini- 
tial condition (2.4). Further, the function yi(9,p) of (4.1) is nonnegative for 
9 G [O,0_ (/?)]. Hence, upon substituting yi{9,p) for max(xi,0) and y2{9,p) 
for X2 on the right-hand side of (2.3), and noticing that {1 — 9) = u{9)\ it is 
not hard to verify that this equation is satisfied by X{y{9),9) = ^u{9)^^^ . Us- 
ing this value of A yields after some algebra that Fi{y{9),9) = — 1 — ^^-^{u — 



1 + e-T"' ' - 7M'-ie-'T"' ') and F2{y{9),9) = --f{l - l)u^-'^ e-^""' \ With 



= —u^~ /I, it is then immediate to verify that the functions given by 
(4.1) and (4.2) indeed satisfy (2.5) as long as 9 < 9^{p). Clearly, y{9,p) of 
(4.1) and (4.2) is infinitely continuously differentiable on [0, 1 — e] x [e, 1/e]. 
With Q(0,/?) Lipschitz continuous in 9 [by (a)], it remains only to show 
that this function is Lipschitz continuous with respect to p£ [e, 1/e]. Since 
the ODE (2.8) is linear and of bounded coefficients, with initial condition 
Q(0) of (2.10) that is Lipschitz continuous in ^ G [e, 1/e] it suffices to show 
that the coefficients Aab{x,9) and Gabix,9), are Lipschitz continuous in x 
on §+(e). We deduce the latter property from Lemma 4.1 upon noting that 
these coefficients are smooth bounded functions of pa and dxf^pa- 

(c) We turn to verify that pc satisfies (4.3). We have already seen that the 
solution of (2.5) starting at y(0) of (2.4) is given for 9 < 9^:{p) < 9^{p) by 
(4.1) and (4.2), and in particular is such that yi{9,p) > for all 9 < 9^{p). 
Further, ph^ 0*ip) is monotone nondecreasing, and since u{l) =0, we see 
that 9^{p) < 1 for all yO > 0. Thus, to complete the proof it suffices to assume 
that for some positive 5 and po the solution of the ODE (2.5) is such that 
yi{G,Po) > for all 9 G [0, ^=k(po) + S] and arrive at a contradiction. To this 
end, note that for p = po and 9 < 9^{pq) + 6, the solution of (2.5) must also 
satisfy the modified ODE 



where F*{x,9) = (-1 + (/ - l)(pi -Pq),-(/ - l)pl) and p* are obtained by 
replacing max(xi,0) in (2.2) and (2.3) with xi. Modifying the set q{e) in the 



(i(detBg) 
d9 



(detB^)Trace(A(y(0),e)) 



(4.6) 
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same manner, it is easy to verify that the statement and proof of Lemma 4.1 
remain vahd for p*(x, 0) (apart from the fact that the latter are not [0,1] 
valued). We also find that 9^ = 1 — £ for the ODE (4.6), from which we 
can deduce that the latter ODE also admits a unique solution subject to 
the initial condition (2.4). Further, the preceding computations show that 
for every p> the solution of (4.6) starting at (2.4) is given by (4.1) and 
(4.2). In particular, at p = po this is also the solution of the ODE (2.5) on 
[0,9^:{po) + 5]. However, by definition of 9*{p), necessarily yi(0,po) of (4-1) 
is nonpositive for some 9 £ (^*(/Oo)5 ^*(/Oo) + resulting with the desired 
contradiction. 

(d) Simple calculus shows that either uh^ hp{u) is monotone increasing 
and positive on (0,oo), which happens for all p large enough, or h'p{u) = 
has exactly two positive solutions, ui = ui[p) corresponding to a local 
maximum of hp and U2 = U2 (p) > ui corresponding to a local minimum of 
hp. With hp{0) = and /ip(-) positive on [1,cxd), while hp{u2{p)) < for all 
p > small enough, it follows from the definition of pc that hp^{u2) = at 
U2 = U2{pc) G (0, 1) and hp^{u) is positive at any positive u ^ U2- Hence, by 
definition 9^{pc) = 1 while 9^,{pc) = 1 — U2{pcf G (0,1)- From part (b) of 
the proposition we thus have that at p = Pc the function yi{9) is infinitely 
continuously differentiable, with yi{9) = h{u{9)) for h{u) = lvl'~^hp^{u) [cf. 
(4.1)]. In particular, yi{9) is then zero when 9 = 9c or 9 = 1 and positive 
elsewhere (per the preceding analysis of hp^). Further, at 9 = 9^ we have 
u{9) =U2{pc), an isolated minimizer of h{u), and as u'{9c) > 0, it follows 
by elementary calculus that y'i{9c) = and y'/ (^c) > 0. Also, h{u) = /u'(l + 
0(n'~^)) for small n, hence yi{l — 5) = 18 + o{5) at p = pc- □ 

We conclude this section by showing that the discrete recursions corre- 
sponding to the mean and covariance of the process z'{-) of (2.12) are near 
the solution of the relevant ODEs (at least for p near pc and up to time 
r„ = [n9c — n^J). More precisely, for A,- = lT-<T-„A(y(r/n, p), r/n), let 

(4.7) r (r + 1) = r (r) + n-iA,(f^(r) - y(r/n)) + ^-^^(^(r/n), r/n), 
starting at y* (0) = y (0, p) and consider the positive definite matrices 

(4.8) qr=Mr'Q{0,p)iMl-y + ^]B;;lG(y(a/n),a/n)(B;;{)t 



CT=0 



for Ml of (2.13). Then: 



Lemma 4.3. Fixing /3 G (3/4, 1) and /?' < 2/3 — 1, we have for all n large 
enough and \p — pd < ~^ 



(4.9) 



n'/Mirn) - f n^'^-^/^ - n'/'{p - Pc)^{9c,Pc 
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the matrices {B^ : cj, r < n} and their inverses are uniformly bounded with 
respect to the L2-operator norm (denoted \\ ■ \\), and 

(4.10) ||Qr„-Q(^c,Pc) II <Cn^~\ 
for some finite C = C{P,f3') and all n. 

Proof. Recall part (a) of Proposition 4.2 that y{0,p) G q{e) for 9 < 
l-2e and p£[e,l /e]. Thus, fixing and < e < (1 - 9^) /2, it follows that 
the operator norm of A,- is uniformly bounded over r < r„, \p — Pc\ ^ ~^ 
and n>no (hereafter and Cj, z = 0, 1, . . . , are two nondecreasing sequences 
of finite constants, each depending only on /3, /?' and e). Consequently, 
the matrices of (2.13) and their inverses are also uniformly bounded with 
respect to the L2 operator norm for n>ni, a, r and p as before. 

We proceed to show that {y*(r),r < r„} is close to the solution y{-,p) of 
the ODE (2.5). To this end, let^ Z);(r) = y*(r) - y{T/n,p), noting that by 
definition D;(t + 1) = (I + n-iA^)Z?;(r) + ^^(t) for r > 0, with D*^{0) = 
and 

r(r+l)/n ^ 

Ur)= / [F{y{T/n),T/n)-F{y{9),0)]d9. 

Jr/n 

By the Lipschitz continuity of {9,p) ^y on [0, 1 — e] x [e, 1/e] (see Propo- 
sition 4.2), we know that \\y{9) — y{T/n)\\ < cq/u for some finite cq, all 
9 G \t jn^T jn + 1/n] and any r < (1 — E)n. Further, since ||-F(x, 9)\ < 21 and 
{x,9) ^ F = (—1 + {I — l)(pi — po))— (^ — l)pi) is Lipschitz continuous on 
q{e) (see Lemma 4.1), we deduce that for some finite constant = C=K(/,e), 
all n, T < (1 — e)n and p G [e, 1/e], 

(4.11) ||^";(r)||<i sup \\F{m,0)- ny{r/n),r/n)\\<C,n-\ 

^ 0G[T/n.,r/n+l/n] 

Since D^{t) = J2^=o^a+iCn{<^) , and ||B^|| are uniformly bounded, we de- 
duce that 

(4.12) sup sup sup n\\y* {t) — y{T/n, p)\\ < ci < 00. 

n>n2 |p-pc|<n'3'-i T<Tn 

Let 9n = Tn/n, A9n = 9n — 9c = —n^~^ and Ap = p — p^. Note that yi{9, pc) > 
c{A9n)^ for some c > 0, ah re and 6* G [0, [see part (d) of Proposition 4.2]. 
Further, recall that |Ap| < re^^'^^ = o{{A9nf) by our choice of < 2/3 - 1. 
The Lipschitz continuity of p^y{9,p) for 9 < 9^[p) thus implies that 
both {9c, Pc) and {9n,p) for re > re4 and \p — pc\ < ~^ are in the set 
Ae = {{9,p) : 9 < min{9-{p), 1 — e),e < p < 1/e} where {9,p) ^yis infinitely 
continuously differentiable [see part (b) of Proposition 4.2]. Hence, by Taylor 
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expanding yi{-) around {9c, Pc) where yi = ^ = 0, we obtain that for some 
c'2, C2 and all n, 



(4.13) 



< 4(1 Ap| + + {AOnf) < C2n3(^-') 

derivatives evaluated at {Oc,Pc 
the left-hand side of (4.9) is bounded above by 



— n2 

[with all partial derivatives evaluated at {9c, Pc)]- Recall that F= -g^, so 



n'^'\\f{Tr,)-y{Tjn,p)\\+n'/^ 



y,{9^,p)-Ap^--{A9n) 



dp 2' 902 



Thus, controlling the first term via (4.12) and the second term via (4.13) 
yields the bound of (4.9). 

Turning now to the proof of (4.10), recall that the solution Q(-) of (2.8) is 
Lipschitz continuous in {9,p) on the set Ae [see part (b) of Proposition 4.2]. 
As both {9c, Pc) and {9n,p), n>n/i are in this set, it follows that for some 
finite C3, C3 and all n, 

, p) - 0(^0, Pc) II < c'3(|A0„| + \Ap\)< c^nP-^ 



(recall that /3' < 2/3 - 1 < 13). Further, Q{9,p) is given by (4.5), where the 
matrices G(y(C)) C) are bounded and Lipschitz continuous in ^ (with respect 
to the L2 operator norm) uniformly in re > 714 and C^^n- The same uniform 
boundedness applies for Q(0,/9) and B^, 0<C<i^!^l — e [see proof of part 
(a) of Proposition 4.2]. Hence, comparing (4.5) and (4.8) we thus deduce 
that (4.10) is an immediate consequence of 

(4.14) sup ||Bf;-i-Bj"|| <C4re-\ 

o<c<e„ ' ^' 

holding for some finite C4 and all re. To this end, let Dn{a,T) = HBJ^-*^ — 
B^y^ll, noting that by the definition of B^ and B^ we have that Dn{(T,a) = 
and for all r > cr, 

Dn{a,T + l) 

(4.15) 

<Dn{a,T) + n-' sup WArMl'^ - A{y{9, p),9)n',\\. 

ee[T/n,{T+l)/n] 

As {y{9, p),9), 9 < 9n and n>n4 are in the set $+(e) in which {x, 9) A(x, 9) 
is bounded and Lipschitz continuous (for the operator norm), it follows that 
for some C5 finite and all re, 

(4.16) sup sup \\Ar-A{y{9,p),9)\\<C5n-\ 

T<Tn 0(z[r/n,{T+l)/n] 
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Further, with \\A{y{9, p),9)\\ bounded uniformly in {9, p), we have from (4.4) 
the existence of cg finite, such that 

(4.17) ||B^-B^;||<C6(|0-e'| + IC-C'l), 

for any p G [e, 1/e], < C < 6* < 1 - e, and < C' < 6^' < 1 - e. So, with A^, 
B^^^ and A(-) uniformly bounded, by the Lipschitz properties (4.16) and 

(4.17) we have that 

WKK'' - Mm^Opl/J < crD^{a,T) + csn-\ 

for some cy, cg finite and all n, o" < r < t„ and 9 £ [r/n, (r + l)/re]. Plugging 
this bound in (4.15) we have that Dn{cT,T + l) < (l + C7n~^)D„(cj, r) + C8n~^, 
from which we deduce that for some cg finite and all n, 

max Dn{<7,T) < cgn~^ . 

0<a-<T<Tn 

By (4.17), this yields the bound (4.14), hence completing the proof of (4.10) 
and that of the lemma. □ 

4.2. Asymptotic enumeration of the graph ensemble. Here we show that 
the initial distribution of the Markov chain z{-) of Section 3 is well approx- 
imated by a multivariate Gaussian law of mean ny{0) and positive definite 
covariance matrix nQ(0), with the rescaled mean y{0) and covariance Q(0) 
given by the initial condition of the corresponding ODE's, namely, (2.4) and 
(2.10), respectively. 

Lemma 4.4. For x G M"' and a positive definite d-dimensional matrix 
A, let Gd(-|x;A) denote the d-dimensional normal density of mean x and 
covariance A. Further, let z = {zi,Z2) denote the number of c-nodes of degree 
1 and of degree strictly greater than 1 in a random graph from the Gi{n, [np\) 
ensemble. Then, for any e > there exist finite, positive constants kq, ki, 
K2 and K3, such that for all n, r, and p£ [e, 1/e], 

(4.18) ||Ez-ny(0)|| <Ko, 

(4.19) P{||z-Ez|| >r} <Kie-'"'/'^2n^ 



(4.20) sup sup 



^{u-z<x}- G2iz\ny{0);nQ{0))dz 

J u-z<x 



-1/2 



Proof. Set m = [np\ and 7 = l/p. Recall that the description of the 
ensemble Gi{n,m) in Section 3.1 provides the following expression for the 
probability P(2*) of having exactly zi c-nodes of degree 1 and Z2 c-nodes of 
degree strictly greater than 1: 

(4.21) F(z) = ^ = ^^^^-=if^'"^'"^^^ 
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where Sm = YIiLiXi for Xi = (%=i, lAr.>2, iV,) G and iV, that are i.i.d. 
Poisson(7) random variables. Consequently, 



Ezi = mF^{Ni = 1} 

r^{S)n^=nl} 



Ez2 = m- Ezi - mP^{iVi = 0} "^"^ -. 

With |/3n-m| < 1 andy(O) of (2.4) such that ny(0) = np(P^{iVi = l},P^{iVi > 

2}), we easily get (4.18) upon using the fact that 5^ is a Poisson(A;7) ran- 
dom variable and the sequence m|e(l — ^)'" — 1| is uniformly bounded. 
By (4.18), in deriving (4.19) we may and shall replace Ei* by ■^y(O) = 

(ES'£\e5'^^). In view of (4.21), the stated bound (4.19) is then merely 

P,{|5(i) - E5(l)|2 + 15^2) _ ES^2)|2 > ^2|^(3) ^ < n,e-^y-2n^ 

which is an immediate consequence of Hoeffding's inequality for the partial 
sums {Sm\ Sm^) and the uniform lower bound F^{Sm^ = nl} > cn~^^^ with 
c > depending only on e and /. 

Observe next that Xi are nondegenerate lattice random variables on M^, 
having minimal lattice Z^, finite moments of all orders and such that 

/pi(l-pi) -pip>2 Pi(l-7) 
cov{Xi)=Y=\ -pip>2 P>2(l-P>2) 7-Pi-7P>2 

\pi(l-7) 7-pi-7P>2 7 
with pi = F^{Ni = 1) = -fe-'f and p>2 = P^(iVi > 2) = 1 - - 76"'^. Thus, 
upon bounding (1 + ||'u||^)Pi(— G3(-|0, V) : {Cu}){u) for the correction term Pi 
of [7], (7.19) (with denoting the cumulants of the law of ^^i), uniformly 
in 7 G [e', 1/e'] and G M^, it follows from Corollary 22.3 of [7] (with s = 3 
there), that for some finite c = c(e'), any such 7, all m and z G Z^, 

_2 

-* CTTl 

(4.22) \F^{Sm = Ze} - G3(i*e|mXc;?nV)| < 



where Ze = {z,nl) and Xc = p~^(y(0),/) = K^Xi. Applying the same argu- 
ment for Sm G IR^, and possibly enlarging c(e') as needed we further have 
that 

(4.23) |P^{5(?)=nO-Gi(n/|m7;mT/33)| <cm~^ 

Next, summing the bound of (4.22) over i* G Z^, we deduce that for some 
finite d = c'(e) any 7 and m, 

(4.24) ^ |P^{5^ = Ze} - G3 (zc I mXe;?nV) I < c'm-\ 
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Further, P^jS"^^ =nl} = J2z^'y{Sm = Ze}, hence we get from (4.21) and the 
bounds of (4.23) and (4.24) that for some finite n = K{e') and any 7 and m, 

G3{Ze\mXemY) 



(4.25) J2 



Gi(n/|m7; mV^ 



33) 



< 



cm ^ + c'm ^ ^ _^i2 



Gi(n/|m7;?TiV'33) 



[with the rightmost inequahty due to the uniform lower bound on m^/^Gi(nZ|m7; 
mVas) for \nl — 7717! < lle'\. The ratio G3(- • •)/Gi(- • •) appearing in (4.25) is 
the conditional distribution of (zi,Z2), given Z3 = nl, under the (joint) law 
G3(- • •), which is thus a Gaussian distribution of mean n'y{0) and the pos- 
itive definite covariance matrix n'Y, with n' = m/p and the entries of the 
two-dimensional matrix V given by Vij = p[Vij — Vi3V^-3/V33]. Upon substi- 
tuting the expressions for pi and p>2, we see that Y coincides with Q(0) of 
(2.10). 

So, it follows from (4.25) that 



sup sup 



F{u-z<x}- J2 ^^izln'yiO); 



n 



u-z<x 



< KTl 



-1/2 



We thus arrive at (4.20) upon observing first that 



1 

sup sup sup — 

h<l ueR-i xGM. 



^ G2(z>^V(0);r 



u-z<x 



G2(^|/i"¥(0);/i"^Q(0))dz 

u-z<x 

is uniformly bounded in 7 by the Euler-MacLaurin sum formula (cf. Theo- 
rem A. 4. 3 of [7] for the Schwartz function G2(-|0; Q(0)), where the correction 
in Ai{x) of [7], (A. 4. 20), to the Gaussian distribution is then at most n'h for 
a finite n'ie), all x G and 7), then noting that y^sup^sup^ \G{u,x;n) — 
G{u,x;n')\ is bounded in 7, n and \n' — n\ < 1/e for the Gaussian distribu- 
tion function G{u, x; r) = /^.^<2. G2(^|ry(0); r<12(0)) dz. □ 

4.3. Asymptotic transition probabilities. We next prove an approximated 
formula for the transition probabilities W^(Az|z), that we often use in 
the sequel. This formula is valid throughout Q+(e) = Q(e) H {zi > 1} C Z^, 
where for each e > 0, 

Q(e) = {{z, t) : —nl + ne < zi;ne < Z2; 

< r < n{l — e);ne <{n — t)1 — max(zi,0) — 2^2} 

is a finite subset of Z^. As many of our approximations involve the rescaled 
variables x = n~"'^z and 6 = T/n, we note in passing that if {z,t) E Q(e), then 
necessarily (x , 0) is in the set q{e) of Lemma 4. 1 and if further {z,t) G Q+ (e) , 
then also (x, 0) E §+(£)• 
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Lemma 4.5. For each 6 G [0,1) let Kq-M^ ^ JCq denote the projec- 
tion onto the convex set ICg = {x & : xi + 2x2 < ~ ^)}- Recall that 
each 9 S [0, 1) and x £ fCg specifies by (2.2) a well-defined probability vector 
(po,pi,p2)- For such 9, x define the transition kernel 

(4.26) wems)^{^, ^ "jp^'pfpr, 



^qo - l,qi,q2, 

where qo = -Azi^ Az2 > Ijjii = -Az2 >0, q2 = l + Azi + 2Az2 > 0. For 
any x G M^, set We{-\x) = We{-\KQ{x)) . That is, Azi = —1 — % + qi and 
Az2 = —qi, with {qo,qi,q2) having the multinomial law of parameters I — 1, 
po, pi, p2 that correspond to the projection of x onto Kq. 

Then, there exists a positive constant C = C{l,e), such that, for any p G 
[e,l/e], {z,T)eQ+{e), Azi € {-I, . . . ,1 - 2} , Az2 e {-{I - 1), . . . ,0}, and 
all n, 

\W^-{Az\z) - W,/^{Az\n-^z)\ < -. 

Proof. Following the notation of Lemma 3.1, for each q = {po,qQ,qi, 
92) e V, let 

(4.27) Q(gl= f^° + ^^ + ^')coeff[(e''-l-x)P°(e''-l)'?^+^^x'-''o], 

V Po,qi,q2 J 

and for z = {zi,Z2) let 

zi-l\ f Z2 



V "^o - 1 / \vo + qi + q2j 

Using the identities zq = z'q — q^ — pQ, z'l — qi = zi — q^ and Z2 — q2 = Z2 — 
Pq — qi — q2 of (3.3), it follows after elementary algebra that gi{z) equals the 
sum over V in (3.2) times the term (, !",)/(,/"?,/). 
Next note that for any A > 0, and integers t,s> 1, 

(4.28) pxit, s) = coeff [(e^ - 1 - x)*,x^]A"(e^ - 1 - A)"* 

is precisely 



Px{t,s)=¥xi^Y^^Ni = s^, 



where {Ni} are i.i.d. random variables, with Pa(-^i = k) = ¥{Nx = k\Nx > 
2) and Nx a Poisson random variable of parameter A > 0. It is not hard to 
explicitly compute 

A(e^-l) 



/l(A)=EA(iV: 



e 



^-1-A' 
A 



/2(A)2 = Var,(iVi) = ^-^_^__^[(e^ - 1)2 _ A^e^], 
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and the normahzed kth moment /fc(A) = EA(iVi - fi{\)Y/f2W^, k>3. 

The behavior of /i was aheady considered in the proof of Lemma 4.1. 
Moreover /2 : M_|_ — > M_|_ is bounded away from zero and infinity when A is 
bounded away from zero and infinity, respectively, resulting with /fc(A) that 
are also bounded away from infinity for each k. 

Using (4.28) and writing explicitly the remaining terms in the expression 
(3.2), it is not hard to verify that 

'/(n-r) - 1^ 
l-l 



W+{Az\z) = n^' 

(4.29) 



X A'+^-(e^ - 1 - A)A^. ^A(4(r^-r- l)/ 

Px{z2,[n-T)l- Zi) 

where z[ = zi + Azi, Z2 = Z2 + Az2 and gi{z) = gi{z) / n^^~^\ 

Let ^ = ((n — t)1 — zi)/ Z2. Since (n — t)1 > z\ + 2z2 we have that ^ > 2, 
and there exists a unique nonnegative solution of /i(A) = ^. Further, as long 
as (z,r) G Q+{e) we get that 2 + {e/p) <i<l/e and hence < A < l/e 
(for p <l/e). We show in the sequel that this implies that there exists a 
positive constant C = C{l,e), such that, for any Azi £ {—I, — 2} and 
Az2e{-(/-l),...,0}, 



pxiz2 + Az2, {n-T)l- zi-l- Azi) _ ^ 



C 
< — . 

n 



(4.30) ( ( M ^ 

Px{z2,{n-T}l - Zi] 

Further, the positive term X^~^^^^{e'^ — 1 — A)'^^^ does not depend on n, 
whereas elementary calculus implies that 

where < C{l)/{ne) in Q+(e). 

We turn to the asymptotic of gi{z) for (z, r) G Q^{e). To this end, note 
that the condition 2pQ + qo + Qi + Q2 implies that the set P is at most 
of size and that the nonnegative coefficients q(g) of (4.27) are bounded, 
uniformly in q by some K = K{1) < oo that is independent of zi and Z2 
(hence independent of n). On Q+(e) the contribution to gi{z) of the term 
indexed by q is at most Kn-^^-^\niyo+'^''+''^+''^~^ . As 2po + qo + qi + q2< 
I, the sum over terms with either pQ > or q2 < I — qo ~ qi is at most 

Consider now q with po = and q2 = I — qo — qi, m which case qi = —Az2 
and qo = —Azi — Az2 > 1 are uniquely determined by Az. Note that ci{q) = 
( '^^'') for these choices of po and q2, resulting with 

(4.32) gK^1 = n-(^o-i) (^i-l)! 1 4-.of l-l y^^^ 

(zi - qoY- [I - ly. \Qo - ^■,qi,q2 J 
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for some \Rn\ < K{l,e)/n. Since 



1 

n 



< n 



-(</o-i)i£L 



{zi - qo)l 



replacing n ^'^^ ^\zi — l)!/(zi — qq)! in (4.32) by x1" and collecting to- 
gether (4.29), (4.30), (4.31) and (4.32), results with the statement (4.26) of 
the lemma (note that 2qi + q2 = l + Azi). 

We complete the proof of the lemma by showing that (4.30) is a con- 
sequence of a local CLT for the sum Sk of i.i.d. lattice random variables 
Xi = (iVj — C)//2(A). Indeed, Xi have zero mean, unit variance and for some 
finite Ck we have that |E(Xf )| = |/fc(A)| < Ck for all {z,t) e Q+{e). Further, 
P\iz2, {n - t)1 - zi) = P(S'fc = 0) and px{z2 + ^Z2, {n-T)l-zi-l- ^zi) = 
W{Sk' = v) for k = Z2, k' -k = Az2 S {-(/ - 1), . . . ,0} and r? = -(/ Azi + 
^Az2)//2(A). Note that rj is uniformly bounded by some ci =ci(/,e) on 
Q+{s) and in the lattice of span 6 = /2(A)~^ of possible values of Sk'- Thus, 
for some finite C2 = C2{1,£), all r] and k' as above, we have by Theorem 5.4 
and (5.27) of [21] that 

/3(A) 



f2iXWknSk' = v)-^ 



+ 



6Vk' 



7] 



<-, 
- k'' 



where (l){u) = e "^/^/\/27r and (f>^^\u) denotes its third derivative. The same 
applies for k and ?? = 0, yielding that 

\f2{X)VkFiSk = 0)-m\<j- 

In particular, with k > ne, we see that P(S'fc = 0) > c-i/y/n for some C3 > 
and all n > no, both depending only upon / and e. As <j){u) is an even function 
with uniformly bounded derivatives of any order, k,k' > en, \rj\ < c\ and 
\k — k'\ < I, it follows that for some finite C4 = C4(/,e), 



Vk 



V 



m 



'k' ' \y/k'J " ' Qk' 
from which (4.30) now directly follows. □ 



V 



< 



C4 



n 



We often rely on the following regularity property of {x,6) 1— > We(-|x) for 
the transition kernels of (4.26). 

Lemma 4.6. With \\ ■ ||tv denoting the total variation norm and \\ ■ \\ the 
Euclidean norm in M^, there exist positive constants L = L{l,e) such that 
for any 9, 6' G [0, 1 — e] and x, x' eM?, 



(4.33) 



\\Wg,{-\x') - We{-\x)\\Tv < L(||f - x|| + 
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Proof. With {x,0) Kq[x) Lipschitz continuous, given that one finite 
set supports the kernels Wq{-\x) for all {x,9) and that Wq{/S.z\x) of (4.26) is 
a smooth function of (p05pi5p2) for x G K,q, we get (4.33) out of the Lipschitz 
continuity of (po,pi,p2) on q{e), proved in Lemma 4.1. □ 

4.4. Absence of small cores. A considerable simplification of our analysis 
comes from the observation that a typical large random hypergraph does not 
have a nonempty core of size below a certain threshold. For the convenience 
of the reader, we next adapt a result of [30] (and its proof) to the context 
of our graph ensemble. 

Lemma 4.7. A subset of v-nodes of a hypergraph is called a stopping 
set if the restriction of the hypergraph to this subset has no c-node of de- 
gree 1. For I > 3 and any e >0 there exist K{l,e) > and C{l,e) finite such 
that for any m>en the probability that a random hypergraph from the en- 
semble Qi{n,m) has a stopping set of less than mK{l,e) v-nodes is at most 
C7(/,e)mi-'/2. 

Remark 4.8. Since the core is the stopping set including the maximal 
number of v-nodes, the lemma implies that for m>en the probability that a 
random hypergraph from the ensemble Qi{n, m) has a nonempty core of size 
less than mK(/,e) is at most C(/, e)m^~'/^. With n < m/e, upon changing k 
to k/e and increasing C as needed, it further follows that the probability of 
having a nonempty core with less than hk v-nodes is at most Cn^~^^'^ . 

Proof. Let N{s,r) denote the number of stopping sets in our random 
hypergraph which involve exactly s v-nodes and r c-nodes. Then, necessarily 
r < [/s/2j and 

(multiply the number of sets of s v-nodes and r c-nodes by the probability 
that such a set forms a stopping set, with coeff[(e'' — 1 — x)'',x*'](s/)! count- 
ing the number of ways of connecting the s v-nodes to these r c-nodes so as 
to form a stopping set, while m*' is the total number of ways of connecting 
the s v-nodes in our graph ensemble). It is easy to see that for any integers 
r,t>l, 

coeff[(e^ - 1 - x)^x*] < (e^ - 1 - x)'^ |x=i< I. 
Hence, for some C = C(^) ^) finite, any m > en, si <m and r < [/s/2j , 

EN(s,r)<("]('^]^<"'"'""" 



s J \ r J m^^ ~ s\ [sl/2\ \ m*' 
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Thus, fixing < k < 1// (so si <m whenever s < nm), for / > 3, the proba- 
bility that a random hypergraph from the ensemble Qi{n,m) has a stopping 
set of size at most uik is bounded above by 



E 



rriK 



[ls/2i 



.5=1 r=l 



provided Ck'/2^^ < 1/2. □ 

5. Auxiliary processes and proof of Proposition 2.1. In this section we 
provide relations between two auxiliary inhomogeneous Z^-valued Markov 
processes whose distributions are denoted, respectively, as Pn,p(') and P„^p(-). 
In both cases, we denote the process as {z{t) = {zi{t),Z2{t)),Q < t < n}, 
and use for both the same initial condition 

P„,p(z(0) =Z)= P„,p(i'(0) = Z) = Pg,(„,„)(z(G) = z) = 

if ze I^l is such that zi + 2z2 < nl, and Pn,p(^(0) = z) = Pn,p(^(0) =z) = 
otherwise. Here Pgj(„ „i)(-) is the uniform distribution on the graph ensemble 
Gi{n, m) and m = [npj . 

Turning to specify the transition kernels, recall the triangles KLq = {x ^ 
M.\:xi + 2x-2<l{l-9)}, d£ [0,1), and set 



Wr{I^z\z) 



I^+(Az|f), if zi>l,n-i?G/C,/„ 

WT-/n{^z\n~^ z), otherwise. 



for of (3.2) and the simpler kernel W6i(-|-) of (4.26). The transition 

probabilities are then 

(5.1) Pn,p(^(T + l) = Z + Az\z{t) =Z) = Wr{Az\z), 

(5.2) Pn,p(?(T + l) = Z + Az\z{t) =z) = Wr/n{Az\n-^z), 

for r = 0,l,...,n — 1. While the Markov process of Lemma 3.1 describing 
the evolution under the decimation algorithm has n~^z{T) G /Cr/n; this is 
not necessarily the case for the two auxiliary processes we consider here. 
Nevertheless, the Markov process of Lemma 3.1 coincides with the one as- 
sociated with Pn,p(-) up to the first time r at which zi{t) = 0, that is, when 
the decimation algorithm terminates at the core of the hypergraph. 

We next provide a coupling that keeps the process of distribution P„^p(-) 
"very close" to its "approximation" by the process of distribution Pn,p(-) as 
long as the former belongs to Qif]) for some 77 > 0. We shall see in Corol- 
lary 5.4 that up to an exponentially small probability (as n^oo), this is 
indeed the case for r < (1 — e)n, allowing us to focus on the properties of 
the simpler distribution P„^p(-). 
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Lemma 5.1. There exist finite C^, = C=i,(/,e) and positive A* = \^:{l,e), 

and a coupling between {z(r)} =P„^p(-) and {i*(r)} =P,„^p(-), such that for 
any n, p £ [e, 1/e] and r > 0, 

(5.3) pjsup ||z(T)-i'(r)|| >r|<C7,e-^*^ 
where t^, <n denotes the first time such that (i'(r*), r,,,) ^ Q(e). 

Proof. To construct the coupling between the two processes, start with 
z'{0) = z{0), which is possible since z{0) and z'{0) are identically distributed. 
Then, for r = 0, 1, . . . , n — 1, with z{t) = z and z'{t) = z* , set z{t + 1) = 
z + Az and /(r + 1) = / + Az' , where the joint distribution (coupling) of 
{Az, Az') is chosen such that 

(5.4) P(Az/ A^\z,z') = \\Wr{-\z) - I^^,/„(>-V)||tv. 

Clearly, it suffices to show that A„(A=k) < for some A* > and C* < oo 
that depend only on / and e, where 

Z{t)^ sup \\z{a)-^{a)l A.(A) ^ E[e^^W], 

(T<TAr« 

for r = {0, . . . ,n} and A > 0. To this end, note first that by our definition 
of Wt-('|-?)) we have from Lemma 4.5 that for some finite c = c{l,£), any 
{z,t) £ Q(e), and all n, 

(5.5) \\WA-\z) - W^/^{-\n-'z)\\TY < - 

[since the kernels VFt-(-|z) and WT-/„(-|n~^i') are nonzero for at most 2P 
points]. Further, with ||Az|| < 21 and ||Ai^|| < 2/, we have that for any < 
A < 1/(4Z) (so e^'''* < 1 + 8/A), cr = 0, 1, . . . , n — 1 and realizations of the two 
processes. 

As is a stopping time and our coupling satisfies (5.4), upon considering 
the expectation of the preceding inequality we get that 

A,+i(A)<A,(A) 
(5-6) _ 

+ 8im{\\WM^ia)) - VF,/„(-|n-ii'(a))||TvW.e^^^"^}. 

Recall that as long as (z(ct),cj) G Q(e), by (5.5) and Lemma 4.6 
(5.7) WW^ma)) - W^/Mn-'^miTY <- + - 
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Since \\z{a) — z*(ct)Po-<t* < ^(f'")) combining the bounds of (5.6) and (5.7), 
we deduce that 

A<,+i(A) < [1 + 8lZn-^X]E{{l + n-^8lLXZ{a))e^^^'''^} 

(5.8) 

< [1 + A^(A(1 + 8lLn^^)). 

Since Ao(A) = 1, taking A = A,, = exp(— 8/L)/(4/) < 1/(4/), and applying 
the inequahty (5.8) for the monotone increasing sequence {Ao-,cr > 0} with 
Ao = A* and A<^+i = A<^(1 + 8/Ln^^), such that A„ = A*(l + 8/L/n)" < 1/(4/), 
we get that 

n-i r n-l ^ 

An(A=,) < Y[ {l + 8lcn-^X„) < expl 8/cn"^ H f ^ exp{8/cA„} < exp{25}, 

(T=0 I a=0 J 

completing the proof of the lemma. □ 

We turn to establish some of the asymptotic (in n ^ cxd) properties of our 
approximating processes [of distribution P„^p(-)]. 

Lemma 5.2. For any I > 3 and e > there exist positive, finite con- 
stants rj <£, and Co,Ci,C2,C3, such that, for any n, p £ [e, 1/e] and t £ 
{0,...,[n{l-e)\}, 

(a) z{t) is exponentially concentrated around its mean 

(5.9) KpiMr) - Ez{t)\\ >r}< 4e-^-'/^o". 

(b) z{t) is close to the solution of the ODE (2.5), 

(5.10) Mz{t) - ny{T/n)\\ < CWnlogn. 

(c) (z(r),r) G Q{rj) with high probability; more precisely, 

(5.11) P„,p{(z(T),r) ^ Qiv)} < C2e-^3^ 

Proof, (a) For r = 0, upon taking Co large enough, this is an immediate 
consequence of (4.19). Turning to the general case, applying the Azuma- 
Hoeffding inequality for Doob's martingale 

Z{a) = E[z{T)\m, ^(<^)], a G {0, . . . ,r}, 

we see that for some cq = co(e) finite, any n, r > 0, r = 1, . . . , n(l — e) and 

p£ [e,l/e], 

(5.12) P„,p{||i'(r) - E[i'(T)|f(0)]|| >r}< 4expi-r^ /(Iclr)), 
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provided esssup||Z(cj) — Z{a — 1)|| < cq for all 1 < cj < r. To this end, with 
z{-) a Markov process, we have the bound 

esssup||Z(cj) — Z{a — 1)|| 

(5.13) 

< sup ||E[z(t)|z((t) = z<^)] -E[z(r)|f(f7) = z<2)]||, 

where the preceding supremum is over all z^^\ z^"^^ such that some trajecto- 
ries {z(0) . . . z{a-l),z{a) = i<i)} and {z(0) . . . z{a-l),z{a) = 2<2)} are both 
of positive probability. In particular, H^^^-* — < 4/. Fixing such z^^^ and 
, let z^^^ (u) and z^"^^ (z/) denote the realizations of two Markov processes of 
same transition kernels (•!•), starting at z^^\a) = z^^^ and z^'^\a) = z^'^\ 
respectively, where for v = a, . . . ,t — 1 the joint distribution (coupling) of 
Ai<i) (i/) = z<i) (i^ + l)- (i^) and Az<2) (zy) = z<2) (i^ + 1) - z<2) (i/) is chosen 
such that 

P(Ai<^) (i/) / Az<2) (zy)|z<i) (i/), z<2) (i/)) 

= \\W,/Mn-'^'Hi^))-W,/ni-\n-'^''HmTV- 

With A(i/) = E||z(-'^)(i/) — z(^^(z^)||, the right-hand side of (5.13) is upper- 
bounded by the supremum of A(r) over all possible pairs of initial conditions 
such that A((t) = < 41. Further, due to the Markov property of 

z and the preceding coupling, for o" < < r we have by (4.33) that 
A(i/ + 1) <E||i<i)(i/) -z<2)(i/)|| +E{E[||Ai<^)(z^) - Az<2)(i/)|||i<i)(z^),^2)^^)]| 

< A(z.) + m{\\W,/^{-\n-'z^'\u)) - W,/ni-\n-'z^^Hi^mTy} 

( 4/L\ , , , 

< 1 + AH. 



n 



With r < n, it thus follows that A(r) < exp(4/L)A((j) < 4/exp(4/L) =: cq, 
as claimed. 

Further, the preceding argument shows that "^^iz) = E[i*(T)|z(0) = z] is a 
uniformly Lipschitz continuous function of of Lipschitz constant ||V'||l = 
exp(4/L) that is independent of r, n and p. Hence, from (4.19) we have that 

P{||^(f(0)) - V'(Ez(0))|| > tIIVIIl} < F{||^(0) - Ez(0)|| > r} < ^le-'''/'^^". 

Integrating this over r > 0, we have that ||E^/^(z(0)) — 'i/'(Ez(0))|| < C\fn for 
some finite constant c depending only on e and Z, yielding that 

P.,,{||E[z(r)|z(0)]-E[.-'(r)]||>r} 

= P{ 11-0(^(0)) -EV'(£(0)) II >r} < Cle-"'/"^'^, 

for some C{ and C2 which depend only on e and /, which, together with 
(5.12), concludes the proof of (5.9). 
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(b) Since ||-?(r)|| < 2nl, choosing r = ^/Conlogn in (5.9) we find that 
(5.14) E||f(r) - Ez{t)\\ < ciy^nlogn, 

for some finite ci(e). Denote by Amir) = \\Kz{t) — ny(r/n) || the error made 
in replacing the expectation of the process z{t) of distribution Pn,p(0 with 
the (rescaled) sohition of the ODE. Then, fixing r < n(l — e), we have by 
the Markov property of z{-) that 

A,n(r + 1) 

= \\Ez{T)-ny{T/n)+E{E[Az{T)\z{T)]}-n[y{T/n + l/n)-y{T/n)]\\. 
RecaU that for ^ < 1 — e, 
dy 



de 



F{y,e) = Y,We{Az\y)^'^ 



so by the triangle inequality we get that 



Amir + 1) < Amir) 



^ J2i^r/n{Az\n-^z{T)) - /n{Az\n-^Ez{T))]A, 



+ 



E E[^r/n(Az|n-iEz(T)) - Ty,/„(Az|y(r/n)]Ai 



Az 



+ n 



r/n+l/n 

[F{m,0)-F{y{T/n),T/n)]d9 

r/n 



Recall as in (4.11) that 6^\t) < C^n'^. Since ||Az|| < Al, we have by (4.33) 
that 

Sg^r) < Am\\Wr/n{Az\n-'mr)) - W^,/„(Aily(rAi))||TV 

AIL 



< Ez(t -ny r/n 

n 



n 



-Amir) 



Similarly, by (4.33) and (5.14), for some C2 = C2(e) finite. 



6i:iHr)<^E\\zir)-Ez{r)\\<cJ'^, 
n \ n 



so putting these estimates together, we obtain the inequality 



A / -,^ ^ I AIL\ . , , /logn 

A^(t + 1)< 1 + A^(t)+C3W^. 

n J y n 
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Further, recall (4.18) of Lemma 4.4 that Am(0) is bounded in n and m = 
[pn\, provided p G [e, Thus, we easily get (5.10) upon applying the 
preceding recursion for r = 0, . . . , n — 1. 

(c) In the course of proving part (a) of Proposition 4.2 we have seen 
that there exists t] = ri{e,l) > such that if p £ [e, 1/e] and 6 < {1 — e), 
then yi{e) > -l + 2r], y2{0) > 2r] and (1 - 6*)/ - max(yi((9), 0) - 2^2(6') > 2r/. 
Consequently, taking t] <e, for such p and r € {0, . . . , [n(l — e)J}, if \\z{t) — 
ny(r/ra)|| < nry/3, then clearly (z(r),r) G Q{t]). We thus get (5.11) upon 
considering (5.9) and (5.10) for r = nrj/G and n such that Ciy/n logn < nri/6. 
□ 

The first consequence of Lemma 5.2 is the existence of "critical time 
window." That is, for p near pc a typical trajectory {z(t); < r < (1 — e)n} 
does not traverse the zi = plane if r is not near nOc- 

Corollary 5.3. Fixing (3 G (3/4, 1), /?' < 2/3 - 1 and e> let In = 
[0,n9c — n^] U [n9c + n^,n{l — e)]. Then, for some C4 finite, t] positive, all 
n and \p — pc\ <n^ 



Proof. From part (d) of Proposition 4.2, we have that nyi{T/n, pc) > 
gj^2/3-i £qj. gQj^g c > 0, all n and r G In- Since /9i— > y{9,p) is Lipschitz con- 
tinuous [by Proposition 4.2, part (a)], there exists a finite constant c' such 
that \\y{6,p) — y{9, pc)\\ < c'n^ ~^ for any G [0, 1 — e] and \p — Pc\ < 

By part (b) of Lemma 5.2, we thus get that for /5' < 2/3 — 1, /? > 3/4, some 
positive C = C{P, /3') and all n large enough, if r G and \p — pc\ <n^~^, 



Applying now Lemma 5.2, part (a), we see that for any 77 < (4/3 — 3)/2, some 
C = C'{f3, (3' ,7]) finite and all n large enough 




then 



IEzi(t) > nyi{T/n, p) - C\\/n log n 



>nyi{T/n,pc) -cnf^' -Ciy/nlogn>2Cn'^'^-'^. 




The second consequence of Lemma 5.2 is that with high probability also 
the process {z{t)} of distribution Pr!,,p(') belongs to the set Q{r]) as long as 
r/n is bounded away from 1. 
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Corollary 5.4. For any e > 0, there exists r] > and positive, finite 
constants C^,Cq such that if pG [e,l/e], then 

(5.15) P„,p{(f(T), r) G Q(7?) VO < r < n(l - e)} > 1 - Cge-^^". 

Proof. From part (c) of Lemma 5.2 we have that for some r]' G (0,e), 
positive and finite C5 and cg, 

(5.16) P„,p{(2(r), r) E Q(r?') VO < r < n(l - e)} > 1 - cse"^"". 

Applying the coupling of Lemma 5.1 with rj = r]' /A for the value of e in the 
statement of this lemma, we also have that 

(5.17) p| sup \\z{t) - z'{t)\\ > r^nj < C7e"^«", 

where tj < n denotes the first time such that (z(r),r) ^ Q{ri). Further, if 
Tf < n{l — e) and suPt-<^^. \\z{t) — z'{t)\\ < r/n, then necessarily (z^(r),r) ^ 
2(47/) = Q(ry') for t = Tf < n(l — e), an event whose probability is at most 
ggg-cen (^5 ig)j_ Combining the latter bound with (5.17) we find that 

Pn,p{T/ < n{l - e)} < C5e-^«" + cye-'^^", 

yielding (5.15) for C5 = 05 + 07 and Cq = min(c6, cs), both finite and positive. 

□ 



Proof of Proposition 2.1. For {z{t);t > 0} distributed according 
to Pn,p(0 let denote the first time at which zi{t) < 0. Since by construc- 
tion (and using Lemma 3.1), the sequence {zi{t);0 <t< r**} is distributed 
as the number of c-nodes of the graphs G{t) having degree 1 under the 
decimation algorithm, we see that Pi{n,p) =Pn,p{T** < n — 1}. 

Further, the core of the initial graph G{0) includes at most Ji — t*=k vertices. 
Consequently, by Lemma 4.7 (cf. Remark 4.8), we can choose D < 00 and 
< K < 1 — 6*0 such that 

Pn,p{r** < n{l - At)} < Pi{n,p) < P„,p{n, < n(l - k)} + ij^, 

for \p — Pel ^ n^'~^ and 5„ = L'n~-'^/^(logn)^. 

By Corollary 5.4, there exist < rj < e < k and finite, positive 0^,0^, such 
that {z{t),0 <t< n{l - e)} C Q{r]) with probability at least 1 - Cge-^^sn^ 
for all n. Hence, we have that 

P„,i min zi{T)<o\<Pi{n,p)<FnJ min ^^(r) < ol + Cse-^e^ + i,^,, 

LO<r<T, J LO<r<T, J 

for 

n =n{l - k) Amin{r : (z(r),r) ^ Q(??)}. 
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By Lemma 5.1, there exist A> 0, and a coupling of the process {z{t)} 
with a process {z'{t)} of distribution P„^p(-), such that, with probabihty 
larger than 1 — l/2?i, up to time r* the distance between these two processes 
is at most = ^logn. Therefore, enlarging D if necessary, we have that 

p| min z[{T)<-en\ --<Pi{n,p)<F{ min 4(r) < e„| + - + ^^„. 

We have seen that r* <n{l — k) with probability of at most Cse"*^"^". Hence, 
enlarging D once more, we find that 



Zi{t) < -£n \ -\5n< Pl{n,p) 
0<T<n{l-K) ' ^ 



< IP 
i ^ n,p 



min zi{t) < En} + Un- 



With 9c < {1 — K.), the set [0,n(l — k)] is the disjoint union of J„ as in 
the statement of the proposition and a set of the form /„ of Corollary 5.3. 
Thus, bounding the probability of the event minT-g/„ zi{t) < Sn via the latter 
corollary yields the thesis of the proposition [enlarging D as needed for 
absorbing the term C4exp(— n'') into □ 

Let Pn,p(-) denote the law of the M^-valued Markov chain {/(r)} of (2.12), 
where (0) has the uniform distribution ^gi{n,m){') on the graph ensemble 
Gl{n,m) for m= [np\, and 

fnA^i^ + 1) = ^(r) + A, + Arin-^z'ir) - y(r/n))|z^(T) = z') 
= W^/r,{Ar\y{T/n)). 

We conclude this section by providing a coupling that keeps the process 
{z'{-)} "sufficiently close" to {z{-)} of distribution Pn,p(-) throughout the 
time interval J„ of interest to us. 



Proposition 5.5. Fixing [3 G (3/4, 1) and (3' <2f3 - I, for any S> P - 
1/2 there exist finite constants a, c and a coupling of the processes {z{-)} 
of distribution P„^p(-) and {-?(•)} of distribution P„^p(-) such that for all n 
and \p — Pc\ <n^~^, 

(5.18) p| sup \\z(t) - ^(t)\\ > cn^\ < ^. 



The key to Proposition 5.5 is the following elementary martingale con- 
centration property. 
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Lemma 5.6. Consider an M.'^ -valued discrete-time martingale {Zg^Ts) 
with Zq = and Us = Zg+i — Zg such that for some finite T and a stopping 
time r=K for Tg 

E[||;7sf e^-^^ I Tg\ < rE[e^-^= | J^,] < oo 

(5.19) 

whenever s < r*, ||A|| < 1. 

Then, for any < a < trVd, 

IP{|l^min(t,r,)ll >a}<2dexp|-^|. 

Proof. Recall that for real- valued variable V, if E[l/] = and 
E[V^exp{uV)] < KE[exp{uV)] < oo for all u G [0,1], then E[exp(y)] < 
exp(Av/2) (bound the value of </>(!) for 0(?x) = logE[exp(My)] using (^(0) = 
(j)'{0) = and (l)"{u) < k). In the special case of d = 1 and r* = oo, we 
have from (5.19) that the preceding assumptions hold for K = rA^, ||A|| < 
1 and V having the law of XUg conditional on J^g- Consequently, then 
E[exp{\Ug)\J='g] < exp(rAV2), implying that E[Mt] < E[Mo] = 1 for the su- 
permartingale Mg = exp{XZg — TX'^s/2). Considering a S [0,tr) and A = 
a/{rt), we thus deduce that ¥{Zt >a}< exp{-aV(2rt)} in case Zg is a real- 
valued martingale for J^g and (5.19) holds for all s < oo. The stated bound 
for M'^-valued martingale Z^ of coordinates Zt^i follows upon noting that the 
event {||.^4|| > a} is contained in the union of the events {uZt^i > a/\/d} for 
u = —1, 1 and i = 1, . . . ,d, with uZg^i real-valued martingales. Finally, we get 
the thesis in the general case, where P(r* < oo) > 0, upon considering the 
(stopped) martingale ^min(s,T,)- ^ 

Proof of Proposition 5.5. We couple the processes {f (•)} = P„_p(-), 

and {z{t)} = Fn,p{-) in a joint Markov process, by letting i*(0) = ^"(0) and 
for r = 0, 1, 2, . . . , n — 1, 

P(Az(t) / Ar\T^) = \\W^/^{-\n-'z{T)) - W^./„(-|y(r/n))||TV, 

where Az{t) = z{t + 1) — z{t) and F-r denotes the c-algebra generated by 
{z{<y),z'{a),a < r}. 

Fixing e < (1 — 0c)/2, let < n denote the first value of r such that 
\\z{t) — ny{T/n)\\ > K\Jn logn, with a finite K = K{e) such that by parts 
(a) and (b) of Lemma 5.2, for any n and p G [e, 1/e], 

Pn,p{T* < ?i(l - e)} < 

Fix P e (3/4, 1), P' and 5 > /? - 1/2. With at most n values for T in Jfi we 
thus obtain (5.18) once we show that some c < oo, all n large enough 

(5.20) sup P{r < \\z{t) - z{t)\\ > cn^} < n'"^ . 

r&Jn 
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To this end, consider Doob's decomposition of the adapted process Ng = 
{Mq~^)~^ {z' (s) — z{s)) as the sum of an ^s-martingale {Zg}, null at zero, 
and the predictable sequence 

r T 

It follows from our coupling that AVs = {MQ~^)~^R(n~^z{s),y{s/n),s/n), 
where 

R{x',x, d) = F{x, 9) + le<0„ A(x, 9) [f - x] - F(f , 9) 
(with 9n = Tn/n = [n9c — /n), and that for = — Az{t), 
(5.21) Ut = Zt+i -Zt = {Ml^^y^Al - E[A:\J^t]}. 

Since Aab{x,6) = dxi,Fa{x,9) with F{x,9) having Lipschitz continuous 
derivatives on q+{e), it follows that ||-R(x',5;, 6')|| < co||x' — x|p for some cq = 
co(e) finite, provided 9 <9n and both {x,9) and {x',9) are in q+{e). By the 
Lipschitz continuity of F{x,9) we also have that ||i?(x',x, ^)|| < co||x' — x|| 
in case 9 > 9n, as soon as {x,9) and {x',9) are in q{e). 

Recall Lemma 4.3 that for some finite no and k we have that ||(B5"^)-i|| < 
K for all T, p£ [e, 1 /e] and n > ng. In the course of proving part (a) of Propo- 
sition 4.2 we have seen that the distance of {y{9,p),9) from the complement 
of q{e) is bounded away from zero, uniformly in ^ < 1 — 2e and p € [e, 1/e]. 
Further, yi{9,p) > K'n'^^^~^^ for some k' > 0, all n, \p — pc\ <n^~^, and 9< 
9n (cf. proof of Corollary 5.3). Consequently, for some finite ni =ni{K,e) 
and all n > ni, the event {s < r^,} implies that both {n~^ z[s) , s / n) and 
{y{s/n), s/n) are in q(e) when s < n9c + n^, and in case s < Tn they are also 
in Q4-(e). We deduce that if n > ni and {s < t*}, then 

IIAV^II < co\\{%~Y^\\\\n-^z{s) - y{s/n)f < CQKK^-Hogn, 

when s < Tn, whereas ||Ays|| < CQKKn~^^'^{logny^'^ for s G J„. Hence, for 
some finite ci and all n > ni, the event {r < t^,} implies for t £ Jn that 
\\Vr\\ < cin^-^/2(-iQg„-)i/2_ Fixing r] e (1/4, /3 - 1/2), since \\z'{t) - z{t)\\ < 
II^O~^II[II^t|I + II-^tII] IIBq""*"!! are bounded uniformly in n, r and p, we 
thus get (5.20) by considering Lemma 5.6 at r € J„, and a = , provided we 
show that for some C2 finite, the martingale differences Ut of (5.21) satisfy 
the inequality (5.19) with T = C2n~"'^/^(logn)^/^ (as indeed ri^ < T„r\/d for 
all n large enough and n?^ /2dTn oo). To this end, note first that by the 
total variation bound of Lemma 4.6 and the definition of r^,, for t < our 
coupling of (z, z!) results with 

P(A; ^ Q\Tt) < L\\n-^z{t) - y{t/n)\\ < LKn'^/^ {log n)^/^ = Un- 
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Further, the bounded support of imphes that || || < 41, so for t < 

also 

||E[A*|^t]|| < AlF{Al / 0\Tt) < 4/minK, 1). 

From the preceding estimates we deduce that Ut of (5.21) is such that \\Ut\\ < 
81k and when t <t^:, also 

¥{\\Ut\\ > 4/Kmin(n„, l)\Tt) < P(A* / 0\Tt) < Un- 

These two facts easily imply that if ||A|| < 1 and t <t^,, then the inequality 
(5.19) holds for F = 2{8lHife^^^'^Un which as we have already seen, completes 
the proof of the proposition. □ 

6. Gaussian approximation and proof of Proposition 2.2. This section is 
devoted to the proof of Proposition 2.2. Specifically, building on Proposi- 
tion 5.5, in Section 6.1 we approximate the Markov process {-?(t)} of distri- 
bution Pn.p(') by a Brownian motion with a quadratic shift, when r is within 
the window Jn around the critical time. Then, in Section 6.2 we show how 
the one-sided Brownian motion with quadratic shift can be replaced by a 
two-sided motion, once its initial condition is appropriately mapped to the 
distribution of the two-sided motion at the critical time. Finally, in Sec- 
tion 6.3 we show that this distribution [which a priori depends on the law 
of z{{))] is also well approximated by a Gaussian law and complete the proof 
of Proposition 2.2. 

6.1. Local approximation hy a Brownian motion with quadratic shift. Our 
goal here is to approximate the probabilities of interest to us in terms of the 
minimal value of the Brownian motion with quadratic shift 

(6.1) Xn{T) = ni/3[x(n-2/3(r - 0.5 - nO,)) - X{n-^'\Tn - 0.5 - nO,))], 

within Jn = [n9c — n^, nOc + n'^], for the process {X[t)} of Proposition 2.2. 
As stated in the following lemma, while doing this we also approximate the 
law of zi{Tn) by that of the sum of n,„ • [i'(O) — ny(0)], where Un denotes 
the first row of Bq"~ , and an independent normal random variable of mean 
nyl{Tn) and variance n[(Qr„)ii - n3^Q(0, /))n,„]. 

Lemma 6.1. Fixing (3 e (3/4, 1) and A>Q, set (5' <2/3-l, En = ^logn 
as in Proposition 2.1 and = init^j^ Xn(t) for Xn{-) of (6.1). Then, for 
any 6 > 3(3 — 2, there exist positive, finite constants a and C such that for 
any n and \p — pc\ < ~^ , 

+ in + Yn< -Cn^} " - < Fn,p( miu Zi (t) < ±8, 

r iX 

<nCn + in + Yn<Cn^] + -, 
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where S^n = Un- [z{0) — ny{0)] , the normal random variable ^* of mean ny\ {Tn) 
and variance n[(QT-„)ii —i4iQ(0;P)^n]) o-ndXn{-), are mutually independent. 

Proof. The strategy we follow is to progressively simplify the process 
{z{t)} of distribution P„^p(-) until obtain the stated bounds of the lemma, 
where each simplification is justified by a coupling argument. The first and 
most important step of this program has already been done in Proposi- 
tion 5.5. Since the chain {z'{-)} of law ¥n,p{-) has independent increments 
for T > r„, we can apply Sakhanenko's refinement of the Hungarian construc- 
tion, to [22, 32] for the uniformly bounded (by 4/) independent increments 
(,i = z'l {Tn + i) — z'l {Tn + i — l). We then deduce the existence of a real- valued 
Gaussian process bn{T), independent of z^(r„), such that hn{Tn) = 0; its in- 
dependent increments A6„(r) = 6„(r + 1) — bn{T) have mean and variance 

EAfe„(r) = Fi{y{T/n, p),T/n), Var A6„,(r) = Gu {y{T/n, p),T/n) 

[matching the corresponding moments of z'i{t + 1) — z^(r)], such that for 
some finite cq, a and all n, p, 

(6.2) p( sup \z[{t) - z[{Tn) - 6„(r)| > cq log nj < 

(the latter follows by Chebyshev's inequality from [32]; see, e.g., [34], The- 
orem A). 

Considering the representation (2.14) for z'{Tn) we see that z\{Tn) — 
^n is the sum of the uniformly bounded real-valued independent variables 
(B^'^jTl^Ao-)!, cr = 0, . . . , r„ — 1 plus a nonrandom constant. Hence, similarly 
to the derivation of (6.2) we obtain that for some finite cq, a and all n, p, 

(6.3) n\A{rn) - - CI > CO logn} < — , 

where ^ is a normal random variable, independent of ^n and bn{-), whose 
mean and variance match those of z'^ {Tn) —^n- It is not hard to verify that the 
latter mean and variance are indeed 7iyJ(r„) and n[(QT-„)ii — uJ^Q{0, p)un], 
as stated. 

We clearly have the representation 

6„(t)= ^ EAbn{a)+Bi ^ VarA6„(a) , 

(T=Tn \(T=T„ I 

for a standard Brownian motion -B(-). Further, the real-valued Gaussian 
process {X„(t),t > t„} of (6.1) admits the representation 

Xn{t) = F r {a In - Oc) da + B{G{t - Tn)), 

JTn-0.5 
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for the same standard Brownian motion B{-), where G = Gu and F = 

both evaluated at 9 = 9^ and y = y{9c,Pc) [so F is as defined in (2.7)]. 
Combining (5.18), (6.2) and (6.3) we estabhsh the thesis of the lemma upon 
showing that the preceding coupling of 6„(-) and Xn{-) is such that for some 
a, c\ finite and all n, 

(6.4) P|sup|6„([t])-X„(i)| >3cin4<^. 

The sup in (6.4) is taken over all real-valued t ^ Jn = [nOc — n^,n6c + n^], 
while in the sequel we use r G Jn to denote an integer in the same interval. 

With {Xn{T + t) - X„(r) :t G [0,1]} having the same law as {B{Gt) + 
o-n^rit) - t £ [0,1]} for nonrandom an^rit) which are bounded uniformly in 
t S [0, 1], n and t £ Jn, we obviously get (6.4) upon showing that 

(6.5) sup P{|6„(r) - Xn{T)\ > 2cin^} < n'^. 

Inequality (6.5) is a direct consequence of having a finite k such that, for 

AXn{T)=Xn{T + l)-Xn{r), 

ei(T) = I Var A6„(r) - VarAX„(T)| = \Gn{y{T/n,p),T/n) - G\ < Kn^-\ 

e2(T) = |EA5„(r)-EAX„(T)| 

= \Fi{y{T/n,p),T/n) - {T/n-9,)F\ < Kn'^^-^\ 

for all T G J„ and \p - pc\ < nf^'~^. Indeed, since 5 > (3 + 2(/3 - 1) > 1 - /3 
and the interval Jn is of length 2[n^], taking ci large enough so cin^ > 
Kn^^^"^) I J„| for all n, the stated bound on e2(-) guarantees that |E6„(r) — 
EX„(r)| < cin^ for all t £ Jn, whereas the corresponding bound on ei(-) 
guarantees that Var(6„,(T) — X„(r)) < cin^n^~^ , leading (by standard Gaus- 
sian tail estimates) to (6.5). 

Turning to bound ei(T) and 62 (r), recall that r G J„ and \p — Pc\ <n^ ~^ 
imply that {r/n^p) G [0,1-e) x [e, 1/e], so \y{T /n, p) - y{T /n, pc)\ <Ci\p- 
Pc\ for some constant Ci = Ci(e) by the Lipschitz continuity of pi— > y{9,p) 
[see part (a) of Proposition 4.2]. Further, then {y{T/n,p),T/n) G g(e), so by 
Lemma 4.1 we have the Lipschitz continuity of p ^ Fi{y{T/n, p),T/n) and 
P^ Gii{y{T / n, p) ,T / n) . That is, for some constant C2 = C'2(e) and all such 
T, p, n, 

(6.6) ei(r) < \Gii{y{T /n, pc),T /n) - Gu{y{9c, Pc),9c)\ +C2|p-pc|, 

(6.7) e2(r) < |Fi (r/n) - (r/n - 9,)F\ + C2\p - Pc|, 

where Fi{9) = Fi{y{9, pc),9). Similarly, the Lipschitz continuity of 1— > y{9, pc) 
on [0, 1 — e) [from part (a) of Proposition 4.2] together with that of {x,9) 1— > 
Gii{x,9) on q{e) (by Lemma 4.1) result in 

\Guiy{r/n,pc),r/n) - Gniy{9c,Pc),ec)\ <C3\T/n - 9c\ <C3n^-\ 
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for some C3 = Cs^e) and all t £ Jn- Thus, with /?' — 1 < 2(/3 — 1), we get 
from (6.6) that ei(r) < Kn^~^ for all t £ Jn and \p — pc\ <n^ as stated. 
As for bounding 62 (r), recall that 9 1— > y{6,pc) is infinitely continuously dif- 
ferentiable on [0, 1 — e] [cf. parts (b) and (d) of Proposition 4.2]. Further, as 
{y{d,pc),6) £ q^{e) for all 9 e[0,l - e] [by (a) and (d) of Proposition 4.2], 
from Lemma 4.1 we have that -Fi(-) is differentiable on [0, 1 — e] with a Lip- 
schitz continuous derivative. Recall that dyi/dO = at 9 = 9c and p = Pc 
[see part (d) of Proposition 4.2]. Hence, Fi{9c) = [in view of the ODE 
(2.5)], and with F = F{{9c) we deduce that for some C4 = C4{e) and all 
9€[0,l-e], 

\F,{9)-{9-9c)F\<C4\9-9c\^. 

Combining (6.7) with the latter bound (for 9 = r/n and r G J„, so \9 — 9c\ < 
n^"-*^), we conclude that 62 (r) < m"^^^'^^ for all t G Jn and \p — Pc\ <n^ ~^ , 
as stated. □ 



6.2. Brownian computations. We show in the sequel that for large s and 
u the distribution of 

Vs,u= inf X{t)-X{-s) 

is well approximated by that of — X{—s) for 14 = \nit^^X{t) and {X{t)} 
an independent copy of {X(t)}. More precisely, we prove: 

Lemma 6.2. With the preceding definitions, for < 99 < 4(1 — ^)/3 — 1, 
all s,u large enough and any nonrandom v, 

P{T4 - X{-s) >v + 2s-^} - 5e^'^ < F{Vs,u > v} 

< P{T4 - X{-s) >v- 2s~^} + 5e-(^^")' 

Proof. Conditioning upon the value of X{—s) we have on account of 
the independence of {X{-t) : t > 0} and {X{t) : t > 0} that 

nVs,u >v}= E[ps{v + X{-s),X{-s))q.ufi{^ + X{-s))], 
where for s > 9 >0 and any a, 5, 

p,(a,6)=p( inf X(t) > a X(-s) = ^l, 

— s<t<0 J 

g„,,„e(a)^p( inf X(t)>aj. 
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Recall that the law of {X{t) : —s < t < 0} conditional upon {X{—s) = b} 
is merely the law of {Xh^sit) = X{t) - f (6 - X{-s)) : -s < t < 0}. Thus, in 
particular, 

(6.8) nVs^u >v}= nP%{^ + X{-s),X{-s))q.ufl{v + X{-s))l 
where 

Similarly, 

(6.9) P{K - X{-s) >v} = E[g_oo,o(^^ + X{-s)f]. 

Fixing < (/? < 4(1 - V')/3 - 1, choose {ip + l)/2<K< 2(1 - V)/3. Then, 
setting p = 1 - ^Z-- K and = s'' (so s"'^ = f s''), it follows that if |6- ^Fs^l < 
s'^, then for all s large enough 

p( sup \Xi,^s{t) - X{t)\>2s~A 

l-6»<t<0 J 

(6.10) 

< P{|X(-s) - \Fs'^\ > s''} < e-'\ 
Consequently, for any value of a, 

(6.11) q^efiia + 2s-^) - e-''" < p5,o(«' ^) < <l-efi{a - 25"'^) + e'^^ 

Relying upon these bounds we next show that if |6 — ^-Fs^l < s'^, then for s 
large enough and all a, 

(6.12) g_oo,o(a + 25-^^) - Se"^^ < /1,o(«> ^) < 9-oo,o(a - Ss"'^) + 36"*^ 
Indeed, with Xfe_s(0) = X{Q) = 0, clearly (6.12) holds for a > [as then 
q_oo,o(fl) = P~-s ol"^' ^) = 0]- Next recall that for c > s > ^ > and any a, h, 

p%{a,h)-[l-p%.g{aM 

(6.13) 

(6.14) g_oo,o(«) < 9-e,o(a) < q~oofl{a) + [1 - g_oo,-0(a)]- 

Combining these with the monotonicity in a of the functions p^^'^ and q, we 
thus get (6.12) also for a < out of (6.11) as soon as we show that for 9 = 
and all large s 

(6.15) l-g_oo,-e(2s'^)=p|^mf^X(t) <2s''| 

(6.16) l-pL1_e(0,fe) =P( inf Xfe,,(t) < ol < 26""^ 

' \^ — s<t<—9 ' J 
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Now, since 2p > k > ip, it follows by standard Gaussian tail estimates that 
for 9 = and s large 

inf X(t) < 2s'^\ 
<-s J 

£ [p|^W^(r)<-|r2| 

+ P| V^^inf jH^(r + t)- W{t)] < -|r 



t< 

oo 

< 

r=L9J 



thus establishing (6.15). Further, as \Xi,^s(t) - X(t)\ < \X{-s) - f s^j _^ |5. 
ys^l, we deduce from (6.10) that if \b — ^Fs'^\ < s'^, then 



4 sup \Xb,s{t)-X{t)\>2sA<e-'\ 

i-s<t<0 J 



which together with (6.15) implies the bound (6.16). 

We now apply in (6.8) standard Gaussian tail estimates for \X(—s) — 
iFs^l > s", and the bounds of (6.12) otherwise. With the [0, l]-valued p^^l^o{a, b) 
and g_s,o('j) monotone in s and a, this results in 

nq-ooA^ + X{-s) + 2s-^'f] - 4e-^" 

(6.17) <nys,u>v} 

< Eiq^^^v + X{-s) - 2s-^)q.ufl{v + X{-s) - 2s-^)] + 46"''^ . 

Finally, if a > 0, then q^ufi{o) = ^-00,0(0^) = 0, whereas for o < 0, taking 
9 = u \n (6.14) we find by (6.15) and the monotonicity of g_oo,-u(o) that 
q-ufi{a) < g-oo,o(o) +exp(— u"^) for u large. Combining this upper bound on 
q-ufi with (6.9) and (6.17) provides the thesis of the lemma. □ 

Since Yn of Lemma 6.1 has the same law as n^^'^Vg^u for s = s{n) = 
n~'^/^[n9c + 0.5 — r,i] and u = u{n) = n~^/^[n'^ — 0.5], we have the follow- 
ing immediate corollary of Lemmas 6.1 and 6.2. 

Corollary 6.3. Fixing P e {3 /A,!) and A> i), set Jn = [n9c-n^ ,n9c + 
n^] and /?' < 2/3 — 1, £n = ^logn as in Proposition 2.1. Let {X(i)} denote 
an i.i.d. copy of the process {X{t)} of Proposition 2.2. Then, for any v < 
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min(7/3 — 3/3,/3/4— 1/6), there exist c finite such that for all n and \p — pd < 

p(^„ + mfX(t) < -n-"] - - <P„,p(min zi(r) <±e„l 

<p|e„ + infX(t)< 71-^^1 + ^, 

where = 'n~^/'^{^n + CJ ~ ^(-n^~^/'^) /and and ^* 0/ Lemma 6.1 are 
independent of both {X{-)} and {X{-)}]. 

Proof. Fixing v < min(7/3-3/3,/?/4-l/6), set 6 > 3/3-2 of Lemma 6.1 
and V e (0, 1/4) of Lemma 6.2 such that < 1/3 - 5 and 1/ < - 2/3). 
Conditioning on the values of and ^* we apply Lemma 6.2 for the values 
of s = s{n) and u = u{n) indicated above, taking there v{n) = n~^^^[^Cn^ — 
in — C] the finite constant C of Lemma 6.1. With s{n)Au{n) > n^"^/"^ — 2 
and P > 2/3, the error terms 5exp(— (s(n) A u{n))^) are accommodated 
within c/(2n) for some finite c and all n. Further, enlarging c if needed, 
with \s{n) - n^"2/^| < 2n"2/^ and v < - 2/3) < 1/3, it is easy to see 
that for all n, 

p||X(-.(n)) - X(-n^-2/3)| > In-^l < ^. 

Our choice of ip and (5 is such that Cn^~^^^ + 2s{n)^'^ < \n~^ for all n > ng, 
so adding to c the constant q of Lemma 6.1 and making sure that c > 2no, 
upon taking the expectation over ^„ and our thesis follows from the latter 
lemma. □ 



6.3. Proof of Proposition 2.2. Fixing /? G (3/4, 1), r G M and /3' < 2/3 - 
1, we have that \pn — Pc\ < for pn = Pc + rn~^/^ and all n large 

enough. Further, taking (3 = 10/13 G (3/4,1) which maximizes the bound 
I/O = min(5/2-3/3,/3/4)-l/6 on u in Corollary 6.3 leads to i/q = 5/26- 1/6 > 
0. Thus, fixing ^4 > 0, the statement (2.11) of the proposition is a conse- 
quence of Corollary 6.3, once we show that for any l/6<ry<!/+l/6<5/26 
and all n large enough. 



+ inf X(t) < ±71-"^ - p|n^/6^(r) + inf X(t) < o| 



< n 



where ^(r) denotes a normal random variable of mean {^^)r and variance 
Qii (both evaluated at 9 = 6^ and p = pc), independent of X{-). Conditioning 
on and ^(r), by the independence of {X{t) : t > 0} and {X{t) : t < 0}, this 
is equivalent to 

(6.18) \E[q{±n-' - 6^)'] - E[q{-n'/'ar)?]\ < 
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where q{a) = F{mit<o X{t) > a). With q'^{a) a [0, 1] -valued monotone non- 
increasing function that approaches zero as a — > oo, we have that for any 
random variables Y, Z and nonrandom v, 

\E[q{vZf]-K[q{vYf]\ < sup |P(Z < x) - P(y < 

X 

Applying this for v = -n^/^, Z = n"^/^^„±n"(''+^/*') and Y = S,{r) of bounded 
density, we deduce that (6.18) holds for all n large enough, thus completing 
the proof of Proposition 2.2 as soon as we show that for rj < 5/2 — 3/3 and 
all n large enough, 

(6.19) sup|P(n-^/^^„ <x)- P(^(r) < x)| < n"'''. 

To this end, recall that n'^/'^in = n~^/2^„ + n-'^/^Cn - n-^/^X{-n'^-^/^), 
where the latter three random variables are independent of each other. 
Hence, in view of (4.20) we have that 

sup|P(n^^/^^n <x)- P(Cn < 2;)| < K3n^i/2 

where is obtained upon replacing ^„ with a normal random variable of zero 
mean and variance nu]^Q(0, p„)'Un [for the positive definite initial condition 

Q{0,p) of the ODE (2.8) at p = With ECn = n^^^yt{Tn) - ^n^^-^l'^, it 
follows from (4.9) that |ECn - 'Ei(j)\ < Cn^^"^/^. Similarly, it follows from 
(4.10) that for some C finite and all n, 

|Var(C„) - Var(^(r))| = |(Q,Jn + Gn'^-i - Qn(0c,Pc)| 

With Var(^(r)) > independent of n, our thesis (6.19) easily follows from 
these bounds on the difference in the mean and variance of the normal 
random variables C„ and ^(r). 
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